You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello. First off, thanks for conducting such extensive evaluations on all of these models. I am finding it very useful for checking my own results. However, when looking into your evaluation files, I've noticed the following:
furthermore, when running sacrebleu on the model output files for FLoRes200, I get different results. It is likely that these are duplicates. Maybe the flores101.output was evaluated twice?
The text was updated successfully, but these errors were encountered:
Hello. First off, thanks for conducting such extensive evaluations on all of these models. I am finding it very useful for checking my own results. However, when looking into your evaluation files, I've noticed the following:
nllb-200-distilled-1.3B/flores101-devtest.eng-deu.eval
:is exactly the same as
nllb-200-distilled-1.3B/flores200-devtest.eng-deu.eval
:furthermore, when running sacrebleu on the model output files for FLoRes200, I get different results. It is likely that these are duplicates. Maybe the flores101.output was evaluated twice?
The text was updated successfully, but these errors were encountered: