- dataset: opus
- model: transformer
- source language(s): est fin fkv_Latn hun izh krl liv_Latn vep vro
- target language(s): est fin fkv_Latn hun izh krl liv_Latn vep vro
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - download: opus-2020-07-26.zip
- test set translations: opus-2020-07-26.test.txt
- test set scores: opus-2020-07-26.eval.txt
testset | BLEU | chr-F |
---|---|---|
Tatoeba-test.est-est.est.est | 2.0 | 0.252 |
Tatoeba-test.est-fin.est.fin | 51.0 | 0.704 |
Tatoeba-test.est-fkv.est.fkv | 1.1 | 0.211 |
Tatoeba-test.est-vep.est.vep | 3.1 | 0.272 |
Tatoeba-test.fin-est.fin.est | 55.2 | 0.722 |
Tatoeba-test.fin-fkv.fin.fkv | 1.6 | 0.207 |
Tatoeba-test.fin-hun.fin.hun | 42.4 | 0.663 |
Tatoeba-test.fin-izh.fin.izh | 12.9 | 0.509 |
Tatoeba-test.fin-krl.fin.krl | 4.6 | 0.292 |
Tatoeba-test.fkv-est.fkv.est | 2.4 | 0.148 |
Tatoeba-test.fkv-fin.fkv.fin | 15.1 | 0.427 |
Tatoeba-test.fkv-liv.fkv.liv | 1.2 | 0.261 |
Tatoeba-test.fkv-vep.fkv.vep | 1.2 | 0.233 |
Tatoeba-test.hun-fin.hun.fin | 47.8 | 0.681 |
Tatoeba-test.izh-fin.izh.fin | 24.0 | 0.615 |
Tatoeba-test.izh-krl.izh.krl | 1.8 | 0.114 |
Tatoeba-test.krl-fin.krl.fin | 13.6 | 0.407 |
Tatoeba-test.krl-izh.krl.izh | 2.7 | 0.096 |
Tatoeba-test.liv-fkv.liv.fkv | 1.2 | 0.164 |
Tatoeba-test.liv-vep.liv.vep | 3.4 | 0.181 |
Tatoeba-test.multi.multi | 36.7 | 0.581 |
Tatoeba-test.vep-est.vep.est | 3.4 | 0.251 |
Tatoeba-test.vep-fkv.vep.fkv | 1.2 | 0.215 |
Tatoeba-test.vep-liv.vep.liv | 3.4 | 0.179 |
- dataset: opus
- model: transformer
- source language(s): eng est fin fkv_Latn hun izh kom kpv krl liv_Latn mdf mhr mrj myv sma sme udm vep vro
- target language(s): eng est fin fkv_Latn hun izh kom kpv krl liv_Latn mdf mhr mrj myv sma sme udm vep vro
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - download: opus-2020-10-04.zip
- test set translations: opus-2020-10-04.test.txt
- test set scores: opus-2020-10-04.eval.txt
testset | BLEU | chr-F |
---|---|---|
newsdev2015-enfi-engfin.eng.fin | 16.6 | 0.500 |
newsdev2015-enfi-fineng.fin.eng | 21.2 | 0.499 |
newsdev2018-enet-engest.eng.est | 17.9 | 0.500 |
newsdev2018-enet-esteng.est.eng | 24.5 | 0.524 |
newssyscomb2009-enghun.eng.hun | 14.1 | 0.459 |
newssyscomb2009-huneng.hun.eng | 18.5 | 0.476 |
newstest2009-enghun.eng.hun | 14.1 | 0.448 |
newstest2009-huneng.hun.eng | 18.0 | 0.472 |
newstest2015-enfi-engfin.eng.fin | 18.2 | 0.513 |
newstest2015-enfi-fineng.fin.eng | 22.4 | 0.504 |
newstest2016-enfi-engfin.eng.fin | 19.1 | 0.523 |
newstest2016-enfi-fineng.fin.eng | 24.1 | 0.529 |
newstest2017-enfi-engfin.eng.fin | 21.7 | 0.545 |
newstest2017-enfi-fineng.fin.eng | 26.3 | 0.542 |
newstest2018-enet-engest.eng.est | 18.6 | 0.511 |
newstest2018-enet-esteng.est.eng | 24.8 | 0.533 |
newstest2018-enfi-engfin.eng.fin | 14.3 | 0.481 |
newstest2018-enfi-fineng.fin.eng | 19.5 | 0.476 |
newstest2019-enfi-engfin.eng.fin | 18.6 | 0.502 |
newstest2019-fien-fineng.fin.eng | 23.6 | 0.517 |
newstestB2016-enfi-engfin.eng.fin | 15.5 | 0.493 |
newstestB2016-enfi-fineng.fin.eng | 20.1 | 0.490 |
newstestB2017-enfi-engfin.eng.fin | 17.8 | 0.511 |
newstestB2017-enfi-fineng.fin.eng | 22.6 | 0.511 |
newstestB2017-fien-fineng.fin.eng | 22.6 | 0.511 |
Tatoeba-test.chm-eng.chm.eng | 2.3 | 0.169 |
Tatoeba-test.eng-chm.eng.chm | 1.6 | 0.149 |
Tatoeba-test.eng-est.eng.est | 47.6 | 0.673 |
Tatoeba-test.eng-fin.eng.fin | 32.9 | 0.579 |
Tatoeba-test.eng-fkv.eng.fkv | 1.6 | 0.175 |
Tatoeba-test.eng-hun.eng.hun | 32.8 | 0.567 |
Tatoeba-test.eng-izh.eng.izh | 2.6 | 0.048 |
Tatoeba-test.eng-kom.eng.kom | 1.2 | 0.011 |
Tatoeba-test.eng-krl.eng.krl | 2.8 | 0.116 |
Tatoeba-test.eng-liv.eng.liv | 2.1 | 0.058 |
Tatoeba-test.eng-mdf.eng.mdf | 2.3 | 0.008 |
Tatoeba-test.eng-myv.eng.myv | 0.9 | 0.013 |
Tatoeba-test.eng-sma.eng.sma | 1.2 | 0.082 |
Tatoeba-test.eng-sme.eng.sme | 12.6 | 0.243 |
Tatoeba-test.eng-udm.eng.udm | 6.5 | 0.169 |
Tatoeba-test.est-eng.est.eng | 53.6 | 0.694 |
Tatoeba-test.est-est.est.est | 2.9 | 0.310 |
Tatoeba-test.est-fin.est.fin | 51.8 | 0.703 |
Tatoeba-test.est-fkv.est.fkv | 1.2 | 0.180 |
Tatoeba-test.est-vep.est.vep | 3.4 | 0.201 |
Tatoeba-test.fin-eng.fin.eng | 46.2 | 0.643 |
Tatoeba-test.fin-est.fin.est | 55.3 | 0.711 |
Tatoeba-test.fin-fkv.fin.fkv | 1.6 | 0.161 |
Tatoeba-test.fin-hun.fin.hun | 44.3 | 0.671 |
Tatoeba-test.fin-izh.fin.izh | 6.8 | 0.296 |
Tatoeba-test.fin-krl.fin.krl | 3.5 | 0.218 |
Tatoeba-test.fkv-eng.fkv.eng | 14.6 | 0.411 |
Tatoeba-test.fkv-est.fkv.est | 10.1 | 0.365 |
Tatoeba-test.fkv-fin.fkv.fin | 22.8 | 0.495 |
Tatoeba-test.fkv-liv.fkv.liv | 1.2 | 0.111 |
Tatoeba-test.fkv-vep.fkv.vep | 1.2 | 0.149 |
Tatoeba-test.hun-eng.hun.eng | 44.2 | 0.619 |
Tatoeba-test.hun-fin.hun.fin | 47.3 | 0.686 |
Tatoeba-test.izh-eng.izh.eng | 24.8 | 0.530 |
Tatoeba-test.izh-fin.izh.fin | 24.0 | 0.623 |
Tatoeba-test.izh-krl.izh.krl | 3.6 | 0.055 |
Tatoeba-test.kom-eng.kom.eng | 1.0 | 0.082 |
Tatoeba-test.krl-eng.krl.eng | 32.9 | 0.488 |
Tatoeba-test.krl-fin.krl.fin | 19.2 | 0.503 |
Tatoeba-test.krl-izh.krl.izh | 3.9 | 0.097 |
Tatoeba-test.liv-eng.liv.eng | 1.7 | 0.078 |
Tatoeba-test.liv-fkv.liv.fkv | 1.2 | 0.144 |
Tatoeba-test.liv-vep.liv.vep | 3.4 | 0.160 |
Tatoeba-test.mdf-eng.mdf.eng | 0.8 | 0.110 |
Tatoeba-test.multi.multi | 39.9 | 0.599 |
Tatoeba-test.myv-eng.myv.eng | 0.9 | 0.114 |
Tatoeba-test.sma-eng.sma.eng | 2.3 | 0.113 |
Tatoeba-test.sme-eng.sme.eng | 8.8 | 0.201 |
Tatoeba-test.udm-eng.udm.eng | 1.2 | 0.158 |
Tatoeba-test.vep-est.vep.est | 1.3 | 0.120 |
Tatoeba-test.vep-fkv.vep.fkv | 0.3 | 0.083 |
Tatoeba-test.vep-liv.vep.liv | 0.8 | 0.072 |
- dataset: opus
- model: transformer
- source language(s): est fin fkv hun izh krl liv vep vro
- target language(s): est fin fkv hun izh krl liv vep vro
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - valid language labels: >>eng<< >>est<< >>fin<< >>hun<< >>sme<< >>mhr<< >>udm<< >>vro<< >>mrj<<
- download: opus-2021-02-23.zip
- test set translations: opus-2021-02-23.test.txt
- test set scores: opus-2021-02-23.eval.txt
testset | BLEU | chr-F | #sent | #words | BP |
---|---|---|---|---|---|
newsdev2015-enfi.eng-fin | 16.6 | 0.500 | 1500 | 23375 | 0.997 |
newsdev2015-enfi.fin-eng | 21.2 | 0.499 | 1500 | 32104 | 0.996 |
newsdev2018-enet.eng-est | 17.9 | 0.500 | 2000 | 34508 | 0.992 |
newsdev2018-enet.est-eng | 24.5 | 0.524 | 2000 | 43194 | 0.997 |
newssyscomb2009.eng-hun | 14.1 | 0.459 | 502 | 9733 | 1.000 |
newssyscomb2009.hun-eng | 18.5 | 0.476 | 502 | 11821 | 0.960 |
newstest2009.eng-hun | 14.1 | 0.448 | 2525 | 54965 | 0.997 |
newstest2009.hun-eng | 18.0 | 0.472 | 2525 | 65402 | 0.941 |
newstest2015-enfi.eng-fin | 18.2 | 0.513 | 1370 | 19968 | 1.000 |
newstest2015-enfi.fin-eng | 22.4 | 0.504 | 1370 | 27356 | 0.985 |
newstest2016-enfi.eng-fin | 19.1 | 0.523 | 3000 | 48116 | 0.976 |
newstest2016-enfi.fin-eng | 24.1 | 0.529 | 3000 | 63043 | 1.000 |
newstest2017-enfi.eng-fin | 21.7 | 0.545 | 3002 | 45718 | 0.983 |
newstest2017-enfi.fin-eng | 26.3 | 0.542 | 3002 | 61936 | 0.990 |
newstest2018-enet.eng-est | 18.6 | 0.511 | 2000 | 36236 | 0.989 |
newstest2018-enet.est-eng | 24.8 | 0.533 | 2000 | 45521 | 0.997 |
newstest2018-enfi.eng-fin | 14.3 | 0.481 | 3000 | 45475 | 1.000 |
newstest2018-enfi.fin-eng | 19.5 | 0.476 | 3000 | 62325 | 0.986 |
newstest2019-enfi.eng-fin | 18.6 | 0.502 | 1997 | 38369 | 0.944 |
newstest2019-fien.fin-eng | 23.6 | 0.517 | 1996 | 36227 | 0.990 |
newstestB2016-enfi.eng-fin | 15.5 | 0.493 | 3000 | 45766 | 1.000 |
newstestB2016-enfi.fin-eng | 20.1 | 0.490 | 3000 | 63043 | 0.990 |
newstestB2017-enfi.eng-fin | 17.8 | 0.511 | 3002 | 45506 | 0.988 |
newstestB2017-enfi.fin-eng | 22.6 | 0.511 | 3002 | 61936 | 1.000 |
newstestB2017-fien.fin-eng | 22.6 | 0.511 | 3002 | 61936 | 1.000 |
Tatoeba-test.est-est | 2.9 | 0.310 | 2 | 50 | 0.754 |
Tatoeba-test.est-fin | 51.8 | 0.703 | 189 | 966 | 0.932 |
Tatoeba-test.est-fkv | 1.2 | 0.180 | 4 | 80 | 1.000 |
Tatoeba-test.est-vep | 3.4 | 0.201 | 1 | 20 | 1.000 |
Tatoeba-test.est-vro | 2.2 | 0.272 | 1 | 24 | 0.549 |
Tatoeba-test.fin-est | 55.3 | 0.711 | 189 | 1051 | 1.000 |
Tatoeba-test.fin-fkv | 1.3 | 0.161 | 297 | 1717 | 1.000 |
Tatoeba-test.fin-hun | 44.0 | 0.670 | 1297 | 6471 | 0.999 |
Tatoeba-test.fin-izh | 6.8 | 0.296 | 3 | 13 | 1.000 |
Tatoeba-test.fin-krl | 3.4 | 0.203 | 29 | 151 | 1.000 |
Tatoeba-test.fkv-est | 10.1 | 0.365 | 4 | 80 | 1.000 |
Tatoeba-test.fkv-fin | 22.4 | 0.489 | 297 | 1664 | 0.990 |
Tatoeba-test.fkv-liv | 1.2 | 0.111 | 4 | 80 | 1.000 |
Tatoeba-test.fkv-vep | 1.2 | 0.149 | 4 | 80 | 1.000 |
Tatoeba-test.hun-fin | 47.1 | 0.684 | 1297 | 6499 | 0.930 |
Tatoeba-test.izh-fin | 24.0 | 0.623 | 3 | 12 | 1.000 |
Tatoeba-test.izh-krl | 3.6 | 0.055 | 3 | 12 | 1.000 |
Tatoeba-test.krl-fin | 18.6 | 0.498 | 29 | 153 | 0.967 |
Tatoeba-test.krl-izh | 3.9 | 0.097 | 3 | 12 | 1.000 |
Tatoeba-test.liv-fkv | 1.2 | 0.144 | 4 | 80 | 1.000 |
Tatoeba-test.liv-vep | 3.4 | 0.160 | 1 | 20 | 1.000 |
Tatoeba-test.multi-multi | 36.6 | 0.586 | 3670 | 19444 | 1.000 |
Tatoeba-test.vep-est | 1.3 | 0.120 | 1 | 20 | 0.156 |
Tatoeba-test.vep-fkv | 0.3 | 0.083 | 4 | 80 | 1.000 |
Tatoeba-test.vep-liv | 0.8 | 0.072 | 1 | 20 | 1.000 |
Tatoeba-test.vro-est | 5.0 | 0.341 | 1 | 26 | 0.920 |
- dataset: opus4m+btTCv20210807
- model: transformer
- source language(s): eng est fin fkv hun izh koi kom kpv krl liv mdf mhr mrj myv sma sme udm vep vot vro
- target language(s): eng est fin fkv hun izh koi kom kpv krl liv mdf mhr mrj myv sma sme udm vep vot vro
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - valid language labels: >>eng<< >>est<< >>fin<< >>hun<< >>sme<< >>mhr<< >>udm<< >>vro<< >>mrj<<
- download: opus4m+btTCv20210807-2021-09-30.zip
- test set translations: opus4m+btTCv20210807-2021-09-30.test.txt
- test set scores: opus4m+btTCv20210807-2021-09-30.eval.txt
testset | BLEU | chr-F | #sent | #words | BP |
---|---|---|---|---|---|
newsdev2015-enfi.eng-fin | 16.9 | 0.505 | 1500 | 23375 | 1.000 |
newsdev2015-enfi.fin-eng | 21.2 | 0.500 | 1500 | 32104 | 0.977 |
newsdev2018-enet.eng-est | 18.3 | 0.504 | 2000 | 34508 | 0.996 |
newsdev2018-enet.est-eng | 23.6 | 0.520 | 2000 | 43194 | 0.981 |
newssyscomb2009.eng-hun | 13.8 | 0.456 | 502 | 9733 | 1.000 |
newssyscomb2009.hun-eng | 18.1 | 0.472 | 502 | 11821 | 0.950 |
newstest2009.eng-hun | 14.3 | 0.450 | 2525 | 54965 | 0.991 |
newstest2009.hun-eng | 18.0 | 0.470 | 2525 | 65402 | 0.938 |
newstest2015-enfi.eng-fin | 18.8 | 0.518 | 1370 | 19968 | 1.000 |
newstest2015-enfi.fin-eng | 22.1 | 0.504 | 1370 | 27356 | 0.965 |
newstest2016-enfi.eng-fin | 20.3 | 0.531 | 3000 | 48116 | 0.989 |
newstest2016-enfi.fin-eng | 24.2 | 0.528 | 3000 | 63043 | 0.991 |
newstest2017-enfi.eng-fin | 22.6 | 0.552 | 3002 | 45718 | 1.000 |
newstest2017-enfi.fin-eng | 26.1 | 0.540 | 3002 | 61936 | 0.972 |
newstest2018-enet.eng-est | 18.8 | 0.512 | 2000 | 36236 | 0.988 |
newstest2018-enet.est-eng | 24.3 | 0.531 | 2000 | 45521 | 0.981 |
newstest2018-enfi.eng-fin | 14.3 | 0.483 | 3000 | 45475 | 1.000 |
newstest2018-enfi.fin-eng | 19.5 | 0.477 | 3000 | 62325 | 0.965 |
newstest2019-enfi.eng-fin | 19.3 | 0.508 | 1997 | 38369 | 0.957 |
newstest2019-fien.fin-eng | 24.2 | 0.518 | 1996 | 36227 | 0.970 |
newstestB2016-enfi.eng-fin | 15.8 | 0.498 | 3000 | 45766 | 1.000 |
newstestB2016-enfi.fin-eng | 19.9 | 0.488 | 3000 | 63043 | 0.967 |
newstestB2017-enfi.eng-fin | 18.1 | 0.514 | 3002 | 45506 | 1.000 |
newstestB2017-enfi.fin-eng | 22.4 | 0.508 | 3002 | 61936 | 0.981 |
newstestB2017-fien.fin-eng | 22.4 | 0.508 | 3002 | 61936 | 0.981 |
Tatoeba-test-v2021-08-07.multi-multi | 36.7 | 0.565 | 10000 | 63872 | 0.971 |