-
Notifications
You must be signed in to change notification settings - Fork 2k
Pull requests: huggingface/open-r1
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Extend max_model_length to prevent context truncation
#463
opened Mar 3, 2025 by
eldarkurtic
Loading…
feat: make reward functions to support parallel computation
#398
opened Feb 23, 2025 by
0x404
Loading…
New GRPO dataset and tasks: formally-verified program correctness
#379
opened Feb 20, 2025 by
ocramz
Loading…
Fix: Default value of
cosine_min_value_wrong
parameter
#305
opened Feb 13, 2025 by
zhangsheng377
Loading…
Simplified installation requirements to support more accelerators
#303
opened Feb 13, 2025 by
ji-huazhong
Loading…
[GRPO] generate with prompt containing the first <think> tag
#283
opened Feb 11, 2025 by
kashif
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.