Skip to content

Latest commit

 

History

History

eng-ron

opus-2020-08-27.zip

  • dataset: opus
  • model: transformer
  • source language(s): eng
  • target language(s): mol ron
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • download: opus-2020-08-27.zip
  • test set translations: opus-2020-08-27.test.txt
  • test set scores: opus-2020-08-27.eval.txt

Benchmarks

testset BLEU chr-F
newsdev2016-enro-engron.eng.ron 30.7 0.591
newstest2016-enro-engron.eng.ron 28.4 0.573
Tatoeba-test.eng.ron 45.0 0.665

opus-2021-02-23.zip

  • dataset: opus
  • model: transformer
  • source language(s): eng
  • target language(s): mol ron
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • valid language labels:
  • download: opus-2021-02-23.zip
  • test set translations: opus-2021-02-23.test.txt
  • test set scores: opus-2021-02-23.eval.txt

Benchmarks

testset BLEU chr-F #sent #words BP
newsdev2016-enro.eng-ron 30.7 0.592 1999 51566 1.000
newstest2016-enro.eng-ron 28.4 0.573 1999 49094 1.000
Tatoeba-test.eng-ron 45.0 0.666 5000 36851 0.990

opus+bt-2021-03-07.zip

  • dataset: opus+bt
  • model: transformer-align
  • source language(s): eng
  • target language(s): mol ron
  • model: transformer-align
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • valid language labels:
  • download: opus+bt-2021-03-07.zip
  • test set translations: opus+bt-2021-03-07.test.txt
  • test set scores: opus+bt-2021-03-07.eval.txt

Benchmarks

testset BLEU chr-F #sent #words BP
newsdev2016-enro.eng-ron 33.5 0.610 1999 51566 0.984
newstest2016-enro.eng-ron 31.7 0.591 1999 49094 0.998
Tatoeba-test.eng-ron 46.9 0.678 5000 36851 0.983

opusTCv20210807+bt_transformer-big_2022-02-25.zip

Benchmarks

testset BLEU chr-F #sent #words BP
newsdev2016-enro.eng-ron 36.9 0.62849 1999 51566 0.990
newstest2016-enro.eng-ron 34.3 0.60802 1999 49094 1.000
Tatoeba-test-v2021-08-07.eng-multi 48.6 0.68634 5508 40367 0.984