Folders and files Name Name Last commit message
Last commit date
parent directory
View all files
dataset: opus1m
model: transformer
source language(s): ast cat fra gcf glg ind ita jak lad min mol msa oci pob por ron spa zlm zsm
target language(s): bel orv rue rus ukr
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
valid language labels: >>rus<< >>ukr<< >>bel<< >>bel_Latn<<
download: opus1m-2021-02-17.zip
test set translations: opus1m-2021-02-17.test.txt
test set scores: opus1m-2021-02-17.eval.txt
testset
BLEU
chr-F
#sent
#words
BP
Tatoeba-test.multi-multi
42.8
0.625
10000
59677
0.991
dataset: opus1m
model: transformer
source language(s): ast cat fra gcf glg ind ita jak lad min mol msa oci pob por ron spa zlm zsm
target language(s): bel orv rue rus ukr
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
valid language labels: >>rus<< >>ukr<< >>bel<< >>bel_Latn<<
download: opus1m-2021-02-18.zip
test set translations: opus1m-2021-02-18.test.txt
test set scores: opus1m-2021-02-18.eval.txt
testset
BLEU
chr-F
#sent
#words
BP
newstest2012.fra-rus
19.9
0.473
3003
64830
1.000
newstest2012.spa-rus
20.5
0.487
3003
64830
1.000
newstest2013.fra-rus
21.3
0.485
3000
58560
0.999
newstest2013.spa-rus
22.7
0.500
3000
58560
1.000
Tatoeba-test.ast-rus
35.9
0.682
1
6
1.000
Tatoeba-test.cat-rus
43.0
0.626
185
1226
0.985
Tatoeba-test.cat-ukr
32.1
0.545
455
2438
0.966
Tatoeba-test.fra-bel
14.7
0.366
283
1703
1.000
Tatoeba-test.fra-orv
0.5
0.166
37
217
1.000
Tatoeba-test.fra-rus
44.7
0.645
10000
60792
0.996
Tatoeba-test.fra-ukr
37.2
0.580
10000
54082
0.989
Tatoeba-test.gcf-rus
0.0
0.237
1
3
1.000
Tatoeba-test.glg-rus
46.4
0.614
37
220
0.986
Tatoeba-test.ita-bel
17.5
0.410
264
1513
0.995
Tatoeba-test.ita-orv
2.1
0.155
8
41
1.000
Tatoeba-test.ita-rus
43.8
0.633
10000
65658
0.986
Tatoeba-test.ita-ukr
43.2
0.639
5000
25353
0.995
Tatoeba-test.lad-bel
1.3
0.266
2
12
1.000
Tatoeba-test.lad-rus
7.5
0.361
18
83
1.000
Tatoeba-test.lad-ukr
1.9
0.270
20
98
1.000
Tatoeba-test.msa-bel
5.2
0.287
3
39
1.000
Tatoeba-test.msa-rus
32.3
0.522
88
594
1.000
Tatoeba-test.msa-ukr
10.3
0.361
9
77
0.905
Tatoeba-test.multi-multi
42.8
0.625
10000
59677
0.991
Tatoeba-test.oci-rus
15.9
0.358
84
531
1.000
Tatoeba-test.por-bel
18.5
0.682
3
16
0.936
Tatoeba-test.por-rus
43.8
0.641
10000
65326
0.998
Tatoeba-test.por-ukr
38.3
0.603
3372
18933
0.991
Tatoeba-test.ron-rus
50.3
0.679
782
4487
0.973
Tatoeba-test.spa-bel
16.1
0.398
205
1259
1.000
Tatoeba-test.spa-orv
1.0
0.151
33
142
1.000
Tatoeba-test.spa-rue
2.3
0.181
97
319
1.000
Tatoeba-test.spa-rus
46.2
0.654
10000
65817
0.986
Tatoeba-test.spa-ukr
38.6
0.596
10000
53833
0.988
You can’t perform that action at this time.