Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

difference between the two models #36

Open
Cooperx521 opened this issue Dec 4, 2024 · 1 comment
Open

difference between the two models #36

Cooperx521 opened this issue Dec 4, 2024 · 1 comment

Comments

@Cooperx521
Copy link

Hi, thanks for your great work!
image

I'm curious about the results in the distractor_3, what is the difference between Qwen2-7B-Instruct-224K and Qwen2-7B-Instruct-extend-step_1000

Best regards :)

@jzhang38
Copy link
Collaborator

jzhang38 commented Dec 5, 2024

I believe Qwen2-7B-Instruct-224K is generated by you.

Qwen2-7B-Instruct-extend-step_100 was there in the repo.
Screenshot 2024-12-05 at 9 30 43 AM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants