opus-2020-10-04.zip dataset: opus model: transformer source language(s): cjy_Hans cjy_Hant cmn cmn_Hans cmn_Hant eng gan hak hak_Hani hsn_Hani lzh lzh_Hans nan wuu yue_Hans yue_Hant target language(s): cjy_Hans cjy_Hant cmn cmn_Hans cmn_Hant eng gan hak hak_Hani hsn_Hani lzh lzh_Hans nan wuu yue_Hans yue_Hant model: transformer pre-processing: normalization + SentencePiece (spm32k,spm32k) a sentence initial language token is required in the form of >>id<< (id = valid target language ID) download: opus-2020-10-04.zip test set translations: opus-2020-10-04.test.txt test set scores: opus-2020-10-04.eval.txt Benchmarks testset BLEU chr-F Tatoeba-test.eng-zho.eng.zho 27.9 0.234 Tatoeba-test.multi.multi 28.8 0.433 Tatoeba-test.zho-eng.zho.eng 30.1 0.498 Tatoeba-test.zho-zho.zho.zho 14.1 0.102