-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Major Cleanup #1 #252
Major Cleanup #1 #252
Conversation
these were inherited from decaNLP but the implementations are old and no longer used. We can import similar metrics from HF/datasets library.
ff40c58
to
83cbff2
Compare
83cbff2
to
5ccaa1a
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, overall is a good cleanup, but you should make a few changes before merging it. Please see the individual comments.
Also, have you tested this code with old non-multilingual models? What about the genie-k8s scripts? Especially since you have removed training/prediction arguments.
3df2253
to
8833a1b
Compare
aa3210a
to
91c3031
Compare
This pull request fixes 3 alerts when merging 91c3031 into a5089ae - view on LGTM.com fixed alerts:
|
A cleanup for genienlp has been long overdue!
We've collected quite some technical debt which we need to address. We have created a project dashboard to keep track of issues we want to address. I've moved the cards this PR addresses to "In progress".
The goal of this series of cleanups is to simplify the code, make it more readable, easier to modify, and more inviting for outside collaborators.
There are no functional changes (besides dropping almond_multilingual tasks and associated code) - tests are still passing.
We're dropping support for old generic datasets and associated metrics which were inherited from decaNLP code.
Those implementations are obsolete and should be replaced with what is now accessible through datasets library. Please see issue. If you want support for a new dataset, feel free to submit a PR for it.