Skip to content

Commit

Permalink
Updated example to use <s> instead of [SEP]
Browse files Browse the repository at this point in the history
  • Loading branch information
sweta20 committed May 6, 2024
1 parent 3a59006 commit 680f1b5
Showing 1 changed file with 5 additions and 3 deletions.
8 changes: 5 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -80,11 +80,13 @@ comet-score -d wmt22:en-de -t PATH/TO/TRANSLATIONS

Scoring with context:
```bash
echo -e "Pies made from apples like these. [SEP] Oh, they do look delicious.\nOh, they do look delicious." >> src.txt
echo -e "Des tartes faites avec des pommes comme celles-ci. [SEP] Elles ont l’air delicieux.\nElles ont l’air delicieux" >> hyp1.txt
echo -e "Des tartes faites avec des pommes comme celles-ci. [SEP] Ils ont l’air delicieux.\nIls ont l’air delicieux." >> hyp2.txt
echo -e "Pies made from apples like these. </s> Oh, they do look delicious.\nOh, they do look delicious." >> src.txt
echo -e "Des tartes faites avec des pommes comme celles-ci. </s> Elles ont l’air delicieux.\nElles ont l’air delicieux" >> hyp1.txt
echo -e "Des tartes faites avec des pommes comme celles-ci. </s> Ils ont l’air delicieux.\nIls ont l’air delicieux." >> hyp2.txt
```

where `</s>` is the separator token of the specific tokenizer (here: `xlm-roberta-large`) that the underlying model uses.

```bash
comet-score -s src.txt -t hyp1.txt hyp2.txt --model Unbabel/wmt20-comet-qe-da --enable-context
```
Expand Down

0 comments on commit 680f1b5

Please sign in to comment.