Skip to content

Actions: huggingface/nanotron

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,839 workflow runs
1,839 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

refactor NanotronParameter to support fp8
Secret Leaks #80: Commit c7d9e8a pushed by xrsrke
November 20, 2024 16:30 20s xrsrke/fp8_for_nanotron
November 20, 2024 16:30 20s
Small fixes when resuming training
Run FA2-related unit tests #689: Pull request #245 synchronize by NouamaneTazi
November 20, 2024 08:59 8m 9s nouamane/fix-optim-states-resuming
November 20, 2024 08:59 8m 9s
Small fixes when resuming training
Code Quality #592: Pull request #245 synchronize by NouamaneTazi
November 20, 2024 08:59 19s nouamane/fix-optim-states-resuming
November 20, 2024 08:59 19s
Small fixes when resuming training
Run non-FA2-related unit tests #689: Pull request #245 synchronize by NouamaneTazi
November 20, 2024 08:59 48m 37s nouamane/fix-optim-states-resuming
November 20, 2024 08:59 48m 37s
.
Secret Leaks #78: Commit e967e78 pushed by NouamaneTazi
November 20, 2024 08:59 18s nouamane/fix-optim-states-resuming
November 20, 2024 08:59 18s
Small fixes when resuming training
Run FA2-related unit tests #688: Pull request #245 opened by NouamaneTazi
November 19, 2024 14:49 7m 19s nouamane/fix-optim-states-resuming
November 19, 2024 14:49 7m 19s
Small fixes when resuming training
Run non-FA2-related unit tests #688: Pull request #245 opened by NouamaneTazi
November 19, 2024 14:49 16m 6s nouamane/fix-optim-states-resuming
November 19, 2024 14:49 16m 6s
backup before doing monkey dispatch fp8 tp
Secret Leaks #74: Commit 9510f57 pushed by xrsrke
November 18, 2024 14:38 16s xrsrke/fp8_for_nanotron
November 18, 2024 14:38 16s
remove transpose in kernel
Secret Leaks #73: Commit e93cf55 pushed by xrsrke
November 4, 2024 10:57 16s xrsrke/fp8_for_nanotron
November 4, 2024 10:57 16s
add dumb transpose in fp8_matmul_kernel
Secret Leaks #72: Commit edb1e87 pushed by xrsrke
November 3, 2024 14:48 19s xrsrke/fp8_for_nanotron
November 3, 2024 14:48 19s
65% speed up in fwd+bwd pass with m=n=k=32768
Secret Leaks #71: Commit 4b26cf1 pushed by xrsrke
November 3, 2024 13:33 16s xrsrke/fp8_for_nanotron
November 3, 2024 13:33 16s
Load random states from checkpoint
Run non-FA2-related unit tests #684: Pull request #238 opened by gritukan
November 2, 2024 17:54 Action required thenno:fix_random_state_load
November 2, 2024 17:54 Action required
Load random states from checkpoint
Run FA2-related unit tests #684: Pull request #238 opened by gritukan
November 2, 2024 17:54 Action required thenno:fix_random_state_load
November 2, 2024 17:54 Action required
Load random states from checkpoint
Code Quality #587: Pull request #238 opened by gritukan
November 2, 2024 17:54 Action required thenno:fix_random_state_load
November 2, 2024 17:54 Action required
add speed benchmark
Secret Leaks #70: Commit f3e3495 pushed by xrsrke
November 1, 2024 18:50 17s xrsrke/fp8_for_nanotron
November 1, 2024 18:50 17s
add bencmark speed with 5% speed up
Secret Leaks #69: Commit c937375 pushed by xrsrke
November 1, 2024 15:34 22s xrsrke/fp8_for_nanotron
November 1, 2024 15:34 22s
remove uncessary .contiguous() in fp8 backward
Secret Leaks #67: Commit 39a4960 pushed by xrsrke
November 1, 2024 13:50 17s xrsrke/fp8_for_nanotron
November 1, 2024 13:50 17s
update profiling script
Secret Leaks #66: Commit b4156dc pushed by xrsrke
November 1, 2024 13:45 17s xrsrke/fp8_for_nanotron
November 1, 2024 13:45 17s
add fp8 tp profiler
Secret Leaks #65: Commit dda00a4 pushed by xrsrke
October 30, 2024 14:38 18s xrsrke/fp8_for_nanotron
October 30, 2024 14:38 18s