Folders and files Name Name Last commit message
Last commit date
parent directory
View all files
dataset: opus
model: transformer
source language(s): eng
target language(s): pob por
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
valid language labels: >>por<< >>pob<<
download: opus-2021-02-18.zip
test set translations: opus-2021-02-18.test.txt
test set scores: opus-2021-02-18.eval.txt
testset
BLEU
chr-F
#sent
#words
BP
Tatoeba-test.eng-por
43.9
0.652
10000
75371
0.969
dataset: opus+bt
model: transformer-align
source language(s): eng
target language(s): pob por
model: transformer-align
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
valid language labels: >>por<< >>pob<<
download: opus+bt-2021-04-14.zip
test set translations: opus+bt-2021-04-14.test.txt
test set scores: opus+bt-2021-04-14.eval.txt
testset
BLEU
chr-F
#sent
#words
BP
Tatoeba-test.eng-por
43.8
0.651
10000
75371
0.972
tico19-test.eng-pob
48.2
0.725
2100
62729
0.943
tico19-test.eng-por
48.0
0.725
2100
62729
0.965
opusTCv20210807+bt_transformer-big_2022-03-13.zip
testset
BLEU
chr-F
#sent
#words
BP
Tatoeba-test-v2021-08-07.eng-multi
49.7
0.69324
10000
79644
0.978
tico19-test.eng-pob
50.0
0.73132
2100
62729
0.950
tico19-test.eng-por
50.2
0.73121
2100
62729
0.954
You can’t perform that action at this time.