Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NQ's mined hard negatives file hn.json contains more queries (70076) than the original NQ train set (58880)? #113

Open
x-zb opened this issue Mar 30, 2024 · 0 comments

Comments

@x-zb
Copy link

x-zb commented Mar 30, 2024

Hi :),

For NQ, it seems in your self-mined hard negatives training set hn.json, there are 70076 queries. But in the original training set downloaded from DPR (biencoder-nq-train.json), there are only 58880 queries. Can I ask where these extra queries are from?

Thanks in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant