Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about reproducing the WISE model on the Temporal dataset. #446

Open
Eliaukyh opened this issue Dec 13, 2024 · 0 comments
Open

Comments

@Eliaukyh
Copy link

Hello!

Q1: I noticed that the code doesn't include an OOD Generalization evaluation metric, which is preventing me from reproducing the results on the temporal task. Could you update the code?

Q2: I’ve run several tests with the WISE model on LLaMA3-8B, but the results haven’t been very good. Both the rewriter_acc and rephrase_acc are around 50. From the perspective of scaling laws, shouldn't a larger model lead to better performance? What steps should I take to improve WISE's performance on LLaMA3?

I look forward to your response, and I apologize for any inconvenience caused. Thank you for your understanding!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant