You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been looking to do a large-scale comparison of (m)any MT model that has to do with Dutch (xx->NL, NL->XX) with all the test sets that I can find. The OPUS leaderboard is a great starting point for me. In a first step, I would like to reproduce the scores in the OPUS leaderboard. For reproducibiliy sake it would therefore be useful if there is an overview of some meta information on the benchmarks:
metric parameters used (e.g. n-gram size for BLEU, model for COMET, etc.)
generation parameters for the models (num beams, sampling (topk/topp/temperature)?
Hello
I've been looking to do a large-scale comparison of (m)any MT model that has to do with Dutch (xx->NL, NL->XX) with all the test sets that I can find. The OPUS leaderboard is a great starting point for me. In a first step, I would like to reproduce the scores in the OPUS leaderboard. For reproducibiliy sake it would therefore be useful if there is an overview of some meta information on the benchmarks:
pip freeze
If you can share any info about this, I'd be grateful!
Bram
The text was updated successfully, but these errors were encountered: