-
Notifications
You must be signed in to change notification settings - Fork 352
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Examples where >1 targets? #813
Comments
Hey @Muennighoff ! Unless I'm misremembering, this is only changed on the |
@Muennighoff, GEM/web_nlg and Yes, the reasoning is because in NLG, having one reference is quite often unreliable, so a lot of test sets are designed with multiple references where multi-ref metrics should be used. Multi-ref metric (for bleu, rouge, sari) support was also added to the Bigscience EH because of this. I know other NLG datasets that are intended for multi-ref test sets are E2E and Totto, but not sure if they were implemented. For promptsource |
As you changed the signature of
apply()
to return a list for targets instead of a string, can you point me to some datasets that use multiple targets?Is randomly picking one the best to get it back to a single string?
The text was updated successfully, but these errors were encountered: