Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

First shot at proper transfer learning #38

Open
zouharvi opened this issue Mar 2, 2020 · 5 comments
Open

First shot at proper transfer learning #38

zouharvi opened this issue Mar 2, 2020 · 5 comments
Assignees
Labels

Comments

@zouharvi
Copy link
Owner

zouharvi commented Mar 2, 2020

Train at opus.yaml, then copy trained model and run a few epochs on base.yaml.

@pixelneo
Copy link
Collaborator

pixelneo commented Mar 6, 2020

By base.yaml, you mean wmt19.yaml in current naming?

@zouharvi
Copy link
Owner Author

zouharvi commented Mar 6, 2020

Yes.

@pixelneo
Copy link
Collaborator

pixelneo commented Mar 15, 2020

ok, transfer learning is done: make tqt runs it, results are still bad.
branch transfer

@zouharvi
Copy link
Owner Author

Ok, there must be something wrong. F1 7% is way under the constant classifier score.

@zouharvi zouharvi reopened this Mar 16, 2020
@pixelneo
Copy link
Collaborator

So, I've tried:

  • modifying learning rate and number of epochs for both models
  • train on wmt, save, load, train again
  • switch quetch for nuqe.

None of this helps. When using nuqe the drop in F1 is not that significant but stil considerable.
Also, I've noticed, when loading a saved model, following info is printed:
[kiwi.data.utils load_vocabularies_to_fields:126] Loaded vocabularies from runs/quetch/best_model.torch

which seems a little weird since I am not loading vocabularies.
I've tried to also explicitly load vocab from the vocab.torch file (and run again several settings - lr, epochs, ...) This did not help.

So basically, I don't know where to go from here other than searching for a bug in openkiwi.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants