Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Excluding multi30k_task2_test_2016 dataset from leaderboard fro eng-deu/deu-eng language pair #2

Open
schniewmatz opened this issue Aug 24, 2023 · 0 comments

Comments

@schniewmatz
Copy link

schniewmatz commented Aug 24, 2023

I am using the leaderboard to decide which model to choose for which language pair.
I find it a very good basis as one obtains an average over a whole set of benchmarks and can - to some extend -judge how stable a model performs.
Going through the example outputs in detail, I nevertheless realized, that the multi30k_task2_test_2016 dataset mostly contains pairs of - almost - unrelated source and reference, for example:

SOURCE: The man with pierced ears is wearing glasses and an orange hat.
REFERENCE: Der Mann trägt eine orange Wollmütze.

Here the pierced ears and the glasses are not present in the reference.

Or even worse:

SOURCE: Two men sitting on the roof of a house while another one stands on a ladder.
REFERENCE: Dachdecker bei der Arbeit.

Here the reference would be transated as "roofers at work".
This is similar for the other examples in the dataset.

I do not know if this dataset has any other valid use case, but I don't find it useful to judge machine translation quality.
Could you remove it from the leaderboard?

@schniewmatz schniewmatz changed the title Excluding multi30k_task2_test_2016 dataset from leaderboard fro eng-deu/eng-deu language pair Excluding multi30k_task2_test_2016 dataset from leaderboard fro eng-deu/deu-eng language pair Aug 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant