- dataset: opus
- model: transformer
- source language(s): pap
- target language(s): eng
- model: transformer
- pre-processing: normalization + SentencePiece (spm12k,spm12k)
- download: opus-2020-07-04.zip
- test set translations: opus-2020-07-04.test.txt
- test set scores: opus-2020-07-04.eval.txt
testset | BLEU | chr-F |
---|---|---|
Tatoeba-test.multi.eng | 48.1 | 0.595 |
Tatoeba-test.pap-eng.pap.eng | 48.1 | 0.595 |
- dataset: opus
- model: transformer
- source language(s): ind max_Latn min pap tmw_Latn zlm_Latn zsm_Latn
- target language(s): eng
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- download: opus-2020-07-26.zip
- test set translations: opus-2020-07-26.test.txt
- test set scores: opus-2020-07-26.eval.txt
testset | BLEU | chr-F |
---|---|---|
Tatoeba-test.msa-eng.msa.eng | 38.3 | 0.570 |
Tatoeba-test.multi.eng | 38.4 | 0.570 |
Tatoeba-test.pap-eng.pap.eng | 52.9 | 0.598 |
- dataset: opus2m
- model: transformer
- source language(s): ind max_Latn min pap tmw_Latn zlm_Latn zsm_Latn
- target language(s): eng
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- download: opus2m-2020-07-31.zip
- test set translations: opus2m-2020-07-31.test.txt
- test set scores: opus2m-2020-07-31.eval.txt
testset | BLEU | chr-F |
---|---|---|
Tatoeba-test.msa-eng.msa.eng | 39.6 | 0.580 |
Tatoeba-test.multi.eng | 39.7 | 0.580 |
Tatoeba-test.pap-eng.pap.eng | 49.1 | 0.579 |
- dataset: opus4m
- model: transformer
- source language(s): ind max_Latn min pap tmw_Latn zlm_Latn zsm_Latn
- target language(s): eng
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- download: opus4m-2020-08-12.zip
- test set translations: opus4m-2020-08-12.test.txt
- test set scores: opus4m-2020-08-12.eval.txt
testset | BLEU | chr-F |
---|---|---|
Tatoeba-test.msa-eng.msa.eng | 40.0 | 0.585 |
Tatoeba-test.multi.eng | 40.1 | 0.585 |
Tatoeba-test.pap-eng.pap.eng | 57.2 | 0.625 |