Skip to content

Actions: huggingface/nanotron

Run FA2-related unit tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
586 workflow runs
586 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #585: Pull request #70 synchronize by xrsrke
July 12, 2024 09:35 3m 14s xrsrke/fp8-end-to-end
July 12, 2024 09:35 3m 14s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #584: Pull request #70 synchronize by xrsrke
July 10, 2024 13:19 42s xrsrke/fp8-end-to-end
July 10, 2024 13:19 42s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #583: Pull request #70 synchronize by xrsrke
July 10, 2024 10:39 35s xrsrke/fp8-end-to-end
July 10, 2024 10:39 35s
[Feature] Monitor model states during training
Run FA2-related unit tests #582: Pull request #183 synchronize by xrsrke
July 10, 2024 03:38 3m 15s xrsrke/monitor_nn
July 10, 2024 03:38 3m 15s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #581: Pull request #70 synchronize by xrsrke
July 9, 2024 08:21 3m 16s xrsrke/fp8-end-to-end
July 9, 2024 08:21 3m 16s
Add layer-wise activation recomputation to llama model
Run FA2-related unit tests #580: Pull request #207 opened by C-TC
July 8, 2024 11:56 3m 19s C-TC:recompute
July 8, 2024 11:56 3m 19s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #579: Pull request #70 synchronize by xrsrke
July 8, 2024 11:35 2m 53s xrsrke/fp8-end-to-end
July 8, 2024 11:35 2m 53s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #578: Pull request #70 synchronize by xrsrke
July 8, 2024 08:45 2m 56s xrsrke/fp8-end-to-end
July 8, 2024 08:45 2m 56s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #577: Pull request #70 synchronize by xrsrke
July 5, 2024 12:53 2m 56s xrsrke/fp8-end-to-end
July 5, 2024 12:53 2m 56s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #576: Pull request #70 synchronize by xrsrke
July 5, 2024 12:36 2m 56s xrsrke/fp8-end-to-end
July 5, 2024 12:36 2m 56s
Move MoE Implementation into src/, add Load Balancing Losses
Run FA2-related unit tests #575: Pull request #192 synchronize by haeggee
July 3, 2024 13:45 3m 14s swiss-ai:moe
July 3, 2024 13:45 3m 14s
Llama3 conversion scripts 🦙
Run FA2-related unit tests #572: Pull request #174 synchronize by ischlag
July 2, 2024 15:04 3m 10s TJ-Solergibert:llama3_converter
July 2, 2024 15:04 3m 10s
Ring attention
Run FA2-related unit tests #571: Pull request #181 synchronize by zzhhjjj
July 2, 2024 14:32 3m 22s zzhhjjj:ring_attention
July 2, 2024 14:32 3m 22s
Ring attention
Run FA2-related unit tests #570: Pull request #181 synchronize by zzhhjjj
July 2, 2024 14:20 3m 22s zzhhjjj:ring_attention
July 2, 2024 14:20 3m 22s
Ring attention
Run FA2-related unit tests #568: Pull request #181 synchronize by zzhhjjj
July 2, 2024 13:33 3m 13s zzhhjjj:ring_attention
July 2, 2024 13:33 3m 13s
Ring attention
Run FA2-related unit tests #566: Pull request #181 synchronize by zzhhjjj
July 2, 2024 13:01 3m 22s zzhhjjj:ring_attention
July 2, 2024 13:01 3m 22s
Fix tp mem cache
Run FA2-related unit tests #565: Pull request #203 opened by AleHD
June 28, 2024 13:27 3m 10s AleHD:fix_tp_mem_cache
June 28, 2024 13:27 3m 10s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #564: Pull request #70 synchronize by xrsrke
June 28, 2024 13:10 2m 58s xrsrke/fp8-end-to-end
June 28, 2024 13:10 2m 58s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #563: Pull request #70 synchronize by xrsrke
June 28, 2024 12:52 2m 52s xrsrke/fp8-end-to-end
June 28, 2024 12:52 2m 52s
refacto generate + use simpler rotary for inference
Run FA2-related unit tests #562: Pull request #202 synchronize by 3outeille
June 25, 2024 12:03 2m 58s refacto-generate-3
June 25, 2024 12:03 2m 58s
refacto generate + use simpler rotary for inference
Run FA2-related unit tests #561: Pull request #202 synchronize by 3outeille
June 25, 2024 12:02 3m 13s refacto-generate-3
June 25, 2024 12:02 3m 13s
refacto generate + use simpler rotary for inference
Run FA2-related unit tests #560: Pull request #202 opened by 3outeille
June 25, 2024 11:54 3m 2s refacto-generate-3
June 25, 2024 11:54 3m 2s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #559: Pull request #70 synchronize by xrsrke
June 18, 2024 10:55 3m 3s xrsrke/fp8-end-to-end
June 18, 2024 10:55 3m 3s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #558: Pull request #70 synchronize by xrsrke
June 17, 2024 11:24 3m 12s xrsrke/fp8-end-to-end
June 17, 2024 11:24 3m 12s
[FP8 Training] End-to-end FP8 Training
Run FA2-related unit tests #557: Pull request #70 synchronize by xrsrke
June 15, 2024 08:15 3m 11s xrsrke/fp8-end-to-end
June 15, 2024 08:15 3m 11s