-
Notifications
You must be signed in to change notification settings - Fork 2k
Issues: huggingface/open-r1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Managed resource "pipeline" is not used within its block and is closed before it's returned
#501
opened Mar 11, 2025 by
keskival
Missing reference per token probability in advatage calculation
#497
opened Mar 10, 2025 by
davidluciolu
System memory usage sudden increase at a particularly step, which leads to memory overflow
#496
opened Mar 10, 2025 by
HuiyuanYan
5 tasks done
[ROCm] GRPO Training with vLLM is hanging on MI300X system, w/o vLLM it works properly
#482
opened Mar 5, 2025 by
nikhil-tensorwave
Why don't rewards increase instead of staying at a certain value in GRPO?
#474
opened Mar 5, 2025 by
AXy1527
how to set the max_model_length, max_new_tokens and generation_size when evaluate ?
#472
opened Mar 5, 2025 by
ItGirls
Is it normal for a 1.5B model on an H100 80G to require several hundred hours for LiveCodeBench?
#466
opened Mar 4, 2025 by
wccccp
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.