Skip to content

Actions: InternLM/lmdeploy

pr_ete_test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,273 workflow runs
2,273 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

pr_ete_test
pr_ete_test #48: Manually run by zhulinJulia24
March 5, 2024 02:48 31m 34s parallelize_testcase
March 5, 2024 02:48 31m 34s
reduce torchengine prefill mem usage
pr_ete_test #47: Pull request #1240 synchronize by grimoire
March 5, 2024 02:30 36m 30s grimoire:reduce-prefill-mem
March 5, 2024 02:30 36m 30s
reduce torchengine prefill mem usage
pr_ete_test #46: Pull request #1240 opened by grimoire
March 4, 2024 13:46 37m 54s grimoire:reduce-prefill-mem
March 4, 2024 13:46 37m 54s
fix bf16 multinomial sampling
pr_ete_test #45: Pull request #1239 opened by grimoire
March 4, 2024 11:06 38m 6s grimoire:fix-sampling
March 4, 2024 11:06 38m 6s
Refactor turbomind attention
pr_ete_test #44: Pull request #1116 synchronize by lzhangzz
March 4, 2024 10:51 1h 10m 46s lzhangzz:tm-attn
March 4, 2024 10:51 1h 10m 46s
Hide qos functions from swagger UI if not applied
pr_ete_test #43: Pull request #1238 opened by AllentDan
March 4, 2024 08:58 2h 1m 39s AllentDan:hide-qos
March 4, 2024 08:58 2h 1m 39s
remove unused kernel in pytorch engine
pr_ete_test #42: Pull request #1237 opened by grimoire
March 4, 2024 07:32 1h 22m 33s grimoire:remove-unused-kernel
March 4, 2024 07:32 1h 22m 33s
Refactor turbomind attention
pr_ete_test #41: Pull request #1116 synchronize by lzhangzz
March 4, 2024 07:23 1h 21m 48s lzhangzz:tm-attn
March 4, 2024 07:23 1h 21m 48s
Refactor turbomind attention
pr_ete_test #40: Pull request #1116 synchronize by lzhangzz
March 4, 2024 07:18 36m 10s lzhangzz:tm-attn
March 4, 2024 07:18 36m 10s
bump version to v0.2.5
pr_ete_test #39: Pull request #1235 synchronize by lvhan028
March 4, 2024 06:57 1h 49m 5s bump-version
March 4, 2024 06:57 1h 49m 5s
Refactor turbomind attention
pr_ete_test #38: Pull request #1116 synchronize by lzhangzz
March 4, 2024 06:21 1h 22m 9s lzhangzz:tm-attn
March 4, 2024 06:21 1h 22m 9s
Refactor turbomind attention
pr_ete_test #37: Pull request #1116 synchronize by lzhangzz
March 4, 2024 06:17 1h 36m 28s lzhangzz:tm-attn
March 4, 2024 06:17 1h 36m 28s
Refactor chat template and support accurate name matching.
pr_ete_test #36: Pull request #1216 synchronize by AllentDan
March 4, 2024 06:08 1h 47m 33s AllentDan:refactor-model
March 4, 2024 06:08 1h 47m 33s
fix pytest version
pr_ete_test #33: Pull request #1236 opened by zhulinJulia24
March 4, 2024 04:03 43m 58s fix_pytest_version
March 4, 2024 04:03 43m 58s
pr_ete_test
pr_ete_test #32: Manually run by zhulinJulia24
March 4, 2024 03:41 39m 23s fix_pytest_version
March 4, 2024 03:41 39m 23s
bump version to v0.2.5
pr_ete_test #31: Pull request #1235 opened by lvhan028
March 4, 2024 03:40 3m 21s bump-version
March 4, 2024 03:40 3m 21s
fix returning logits in prefill phase of pytorch engine
pr_ete_test #30: Pull request #1209 synchronize by grimoire
March 4, 2024 03:26 15m 53s grimoire:fix-decode
March 4, 2024 03:26 15m 53s
[WIP] support Medusa
pr_ete_test #26: Pull request #1231 opened by zhyncs
March 3, 2024 04:58 37m 12s zhyncs:medusa-plugin
March 3, 2024 04:58 37m 12s
fix multinomial sampling
pr_ete_test #25: Pull request #1228 synchronize by grimoire
March 3, 2024 03:18 34m 24s grimoire:fix-multinomial-sampling
March 3, 2024 03:18 34m 24s
Fix None session_len
pr_ete_test #24: Pull request #1230 opened by lvhan028
March 2, 2024 11:37 36m 0s lvhan028:fix-none-session-len
March 2, 2024 11:37 36m 0s
ProTip! You can narrow down the results and go further in time using created:<2024-03-02 or the other filters available.