opus-2020-10-04.zip dataset: opus model: transformer source language(s): eng kat target language(s): eng kat model: transformer pre-processing: normalization + SentencePiece (spm32k,spm32k) a sentence initial language token is required in the form of >>id<< (id = valid target language ID) download: opus-2020-10-04.zip test set translations: opus-2020-10-04.test.txt test set scores: opus-2020-10-04.eval.txt Benchmarks testset BLEU chr-F Tatoeba-test.eng-kat.eng.kat 4.6 0.163 Tatoeba-test.kat-eng.kat.eng 40.3 0.564 Tatoeba-test.multi.multi 25.4 0.365 opus4m+btTCv20210807-2021-09-30.zip dataset: opus4m+btTCv20210807 model: transformer source language(s): eng kat xmf target language(s): eng kat xmf model: transformer pre-processing: normalization + SentencePiece (spm32k,spm32k) a sentence initial language token is required in the form of >>id<< (id = valid target language ID) valid language labels: >>eng<< >>kat<< download: opus4m+btTCv20210807-2021-09-30.zip test set translations: opus4m+btTCv20210807-2021-09-30.test.txt test set scores: opus4m+btTCv20210807-2021-09-30.eval.txt Benchmarks testset BLEU chr-F #sent #words BP Tatoeba-test-v2021-08-07.multi-multi 29.7 0.452 2061 10985 1.000