Skip to content

Actions: huggingface/open-r1

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
363 workflow runs
363 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Testing Github workflow] Updating workflows and makefile
Tests #23: Pull request #214 synchronize by zeenolife
February 7, 2025 20:19 Action required zeenolife:almaz/test-workflow
February 7, 2025 20:19 Action required
Remove duplicate math-verify (#234)
Quality #374: Commit 3519a7f pushed by lewtun
February 7, 2025 19:01 2m 31s main
February 7, 2025 19:01 2m 31s
Remove duplicate math-verify
Quality #373: Pull request #234 opened by lewtun
February 7, 2025 18:58 3m 7s fix-setup
February 7, 2025 18:58 3m 7s
Remove puzzles (#233)
Quality #372: Commit 9c768d5 pushed by Rocketknight1
February 7, 2025 16:52 2m 22s main
February 7, 2025 16:52 2m 22s
Remove puzzles
Quality #371: Pull request #233 opened by Rocketknight1
February 7, 2025 16:49 2m 30s deprecate-puzzles
February 7, 2025 16:49 2m 30s
Refactor training configs and unify Slurm for training SFT & GRPO (#231)
Quality #370: Commit 0da0f7c pushed by lewtun
February 7, 2025 14:56 2m 16s main
February 7, 2025 14:56 2m 16s
Refactor training configs and unify Slurm for training SFT & GRPO
Quality #369: Pull request #231 synchronize by lewtun
February 7, 2025 14:37 2m 26s refactor-slurm
February 7, 2025 14:37 2m 26s
Use new GRPO logic
Quality #368: Pull request #232 synchronize by qgallouedec
February 7, 2025 14:30 2m 19s update-grpo-params
February 7, 2025 14:30 2m 19s
Refactor training configs and unify Slurm for training SFT & GRPO
Quality #367: Pull request #231 synchronize by lewtun
February 7, 2025 14:26 2m 18s refactor-slurm
February 7, 2025 14:26 2m 18s
Use new GRPO logic
Quality #366: Pull request #232 synchronize by qgallouedec
February 7, 2025 14:22 2m 28s update-grpo-params
February 7, 2025 14:22 2m 28s
Refactor training configs and unify Slurm for training SFT & GRPO
Quality #365: Pull request #231 synchronize by lewtun
February 7, 2025 14:22 45s refactor-slurm
February 7, 2025 14:22 45s
Fix cosine_scaled_reward compatibility with GRPO (#229)
Quality #364: Commit dd915f8 pushed by qgallouedec
February 7, 2025 14:21 2m 20s main
February 7, 2025 14:21 2m 20s
Refactor training configs and unify Slurm for training SFT & GRPO
Quality #363: Pull request #231 synchronize by lewtun
February 7, 2025 14:18 4m 0s refactor-slurm
February 7, 2025 14:18 4m 0s
Refactor training configs and unify Slurm for training SFT & GRPO
Quality #362: Pull request #231 synchronize by lewtun
February 7, 2025 14:18 37s refactor-slurm
February 7, 2025 14:18 37s
Use new GRPO logic
Quality #361: Pull request #232 opened by qgallouedec
February 7, 2025 14:17 2m 27s update-grpo-params
February 7, 2025 14:17 2m 27s
Fix cosine_scaled_reward compatibility with GRPO
Quality #360: Pull request #229 synchronize by qgallouedec
February 7, 2025 14:15 2m 27s fix-cosine_scaled_reward
February 7, 2025 14:15 2m 27s
Fix cosine_scaled_reward compatibility with GRPO
Quality #359: Pull request #229 synchronize by qgallouedec
February 7, 2025 14:02 2m 22s fix-cosine_scaled_reward
February 7, 2025 14:02 2m 22s
Fix cosine_scaled_reward compatibility with GRPO
Quality #358: Pull request #229 opened by qgallouedec
February 7, 2025 14:01 2m 15s fix-cosine_scaled_reward
February 7, 2025 14:01 2m 15s
fix config name (#222)
Quality #357: Commit dba152a pushed by qgallouedec
February 7, 2025 13:34 2m 15s main
February 7, 2025 13:34 2m 15s
Fix config name
Quality #355: Pull request #222 opened by qgallouedec
February 7, 2025 09:40 2m 22s fix-config-name
February 7, 2025 09:40 2m 22s
Weighted reward functions
Quality #354: Pull request #213 synchronize by zeenolife
February 7, 2025 08:50 2m 30s zeenolife:almaz/reward-weights
February 7, 2025 08:50 2m 30s
[GRPO] add cosine reward (#206)
Quality #353: Commit 250ab46 pushed by kashif
February 7, 2025 07:10 2m 25s main
February 7, 2025 07:10 2m 25s
[GRPO] add cosine reward
Quality #352: Pull request #206 synchronize by kashif
February 7, 2025 07:02 2m 28s cosine-rewards
February 7, 2025 07:02 2m 28s