Skip to content

Actions: huggingface/open-r1

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
364 workflow runs
364 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[GRPO] add cosine reward
Quality #324: Pull request #206 synchronize by kashif
February 6, 2025 13:42 2m 22s cosine-rewards
February 6, 2025 13:42 2m 22s
[GRPO] add cosine reward
Quality #323: Pull request #206 synchronize by kashif
February 6, 2025 13:33 2m 26s cosine-rewards
February 6, 2025 13:33 2m 26s
Add GPQA Diamond and fix evaluation deps
Quality #322: Pull request #196 synchronize by lewtun
February 6, 2025 13:28 2m 31s lewtun/add-gpqa-cmd
February 6, 2025 13:28 2m 31s
Add GPQA Diamond and fix evaluation deps
Quality #321: Pull request #196 synchronize by lewtun
February 6, 2025 13:22 3m 5s lewtun/add-gpqa-cmd
February 6, 2025 13:22 3m 5s
Add GPQA Diamond and fix evaluation deps
Quality #320: Pull request #196 synchronize by lewtun
February 6, 2025 13:15 2m 19s lewtun/add-gpqa-cmd
February 6, 2025 13:15 2m 19s
Add GPQA Diamond and fix evaluation deps
Quality #319: Pull request #196 synchronize by lewtun
February 6, 2025 13:07 2m 18s lewtun/add-gpqa-cmd
February 6, 2025 13:07 2m 18s
[GRPO] add cosine reward
Quality #318: Pull request #206 opened by kashif
February 6, 2025 12:50 2m 35s cosine-rewards
February 6, 2025 12:50 2m 35s
Add GPQA Diamond and fix evaluation deps
Quality #317: Pull request #196 synchronize by lewtun
February 6, 2025 12:49 2m 18s lewtun/add-gpqa-cmd
February 6, 2025 12:49 2m 18s
Update sft.py (#201)
Quality #316: Commit f8cbb98 pushed by lewtun
February 6, 2025 12:33 4m 14s main
February 6, 2025 12:33 4m 14s
Add GPQA Diamond and fix evaluation deps
Quality #315: Pull request #196 synchronize by lewtun
February 6, 2025 11:05 2m 15s lewtun/add-gpqa-cmd
February 6, 2025 11:05 2m 15s
Provide a minimal reproducible experiment using GRPO for mathematical…
Quality #314: Commit 571661a pushed by edbeeching
February 6, 2025 10:43 2m 25s main
February 6, 2025 10:43 2m 25s
Add GPQA Diamond and fix evaluation deps
Quality #313: Pull request #196 synchronize by lewtun
February 6, 2025 10:15 2m 25s lewtun/add-gpqa-cmd
February 6, 2025 10:15 2m 25s
Add GPQA Diamond and fix evaluation deps
Quality #312: Pull request #196 synchronize by lewtun
February 6, 2025 09:53 2m 18s lewtun/add-gpqa-cmd
February 6, 2025 09:53 2m 18s
Add GPQA Diamond and fix evaluation deps
Quality #311: Pull request #196 synchronize by lewtun
February 6, 2025 08:46 2m 16s lewtun/add-gpqa-cmd
February 6, 2025 08:46 2m 16s
Optimize the code
Quality #309: Pull request #203 opened by HendricksJudy
February 6, 2025 08:24 Action required HendricksJudy:optimize-code
February 6, 2025 08:24 Action required
Add GPQA Diamond and fix evaluation deps
Quality #308: Pull request #196 synchronize by lewtun
February 6, 2025 08:17 2m 22s lewtun/add-gpqa-cmd
February 6, 2025 08:17 2m 22s
Update: Fix eval crash by disabling vLLM when using DeepSpeed
Quality #307: Pull request #147 synchronize by ATaylorAerospace
February 6, 2025 08:03 Action required ATaylorAerospace:main
February 6, 2025 08:03 Action required
SFT Trainer Model Card Creation Fix
Quality #306: Pull request #201 opened by westonbrown
February 6, 2025 05:43 2m 20s westonbrown:patch-1
February 6, 2025 05:43 2m 20s
fix: easier environment setup; pin trl, transformers
Quality #305: Pull request #199 synchronize by ctjlewis
February 6, 2025 00:47 Action required ctjlewis:easier-setup
February 6, 2025 00:47 Action required
fix: easier environment setup; pin trl, transformers
Quality #304: Pull request #199 synchronize by ctjlewis
February 6, 2025 00:44 Action required ctjlewis:easier-setup
February 6, 2025 00:44 Action required
fix: easier environment setup; pin trl, transformers
Quality #303: Pull request #199 synchronize by ctjlewis
February 6, 2025 00:43 Action required ctjlewis:easier-setup
February 6, 2025 00:43 Action required
fix: easier environment setup; pin trl, transformers
Quality #302: Pull request #199 opened by ctjlewis
February 6, 2025 00:37 Action required ctjlewis:easier-setup
February 6, 2025 00:37 Action required
Update grpo.py (#171)
Quality #301: Commit 736b59f pushed by lewtun
February 5, 2025 23:25 2m 20s main
February 5, 2025 23:25 2m 20s
Add GPQA Diamond and fix evaluation deps
Quality #300: Pull request #196 synchronize by lewtun
February 5, 2025 22:59 3m 6s lewtun/add-gpqa-cmd
February 5, 2025 22:59 3m 6s