Skip to content

Actions: mlc-ai/mlc-llm

Build Docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
596 workflow runs
596 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[C++] Skip getting global multi-gpu function when disco not enabled (…
Build Docs #496: Commit a25c796 pushed by MasterJH5574
August 14, 2024 15:37 10m 11s main
August 14, 2024 15:37 10m 11s
[Model] Support Minicpm (#2755)
Build Docs #495: Commit 266957f pushed by MasterJH5574
August 14, 2024 02:55 6m 22s main
August 14, 2024 02:55 6m 22s
[Bench] Update warmup phase to warm up more batch sizes (#2803)
Build Docs #494: Commit 4c9d6da pushed by MasterJH5574
August 14, 2024 02:54 6m 22s main
August 14, 2024 02:54 6m 22s
[Fix] Fix prefix cache (#2798)
Build Docs #493: Commit c7951ab pushed by tqchen
August 13, 2024 23:54 6m 28s main
August 13, 2024 23:54 6m 28s
[Android] Update compileSDK to 35 (#2800)
Build Docs #492: Commit b882f9a pushed by mengshyu
August 13, 2024 16:48 7m 39s main
August 13, 2024 16:48 7m 39s
[Preset] Add snowflake-arctic-embed to preset (#2795)
Build Docs #491: Commit aab513a pushed by CharlieFRuan
August 12, 2024 15:36 6m 39s main
August 12, 2024 15:36 6m 39s
[Grammar] Add SetStopTokenIds for tool calling (#2767)
Build Docs #490: Commit fcf055a pushed by CharlieFRuan
August 10, 2024 17:24 6m 18s main
August 10, 2024 17:24 6m 18s
[Bench] Improve benchmark compatibility (#2794)
Build Docs #489: Commit 37860b0 pushed by tqchen
August 10, 2024 16:24 6m 43s main
August 10, 2024 16:24 6m 43s
[Windows] Windows build target definitions for adreno (#2785)
Build Docs #488: Commit d497790 pushed by tqchen
August 9, 2024 13:34 8m 0s main
August 9, 2024 13:34 8m 0s
[CLI] Add CPU check device option (#2786)
Build Docs #487: Commit 63d7163 pushed by tqchen
August 9, 2024 11:14 7m 23s main
August 9, 2024 11:14 7m 23s
[Bench] Rename json schema dataset (#2781)
Build Docs #486: Commit 2b770e5 pushed by tqchen
August 9, 2024 11:13 6m 16s main
August 9, 2024 11:13 6m 16s
[Fix] Fix qwen2 chat template (#2784)
Build Docs #485: Commit 09978ec pushed by tqchen
August 9, 2024 11:13 6m 30s main
August 9, 2024 11:13 6m 30s
[Bench] Json schema dataset (#2775)
Build Docs #484: Commit 36055f0 pushed by MasterJH5574
August 8, 2024 18:11 7m 31s main
August 8, 2024 18:11 7m 31s
[CI] Enable long file paths for Windows CI (#2779)
Build Docs #483: Commit 603b3ec pushed by MasterJH5574
August 8, 2024 15:25 7m 4s main
August 8, 2024 15:25 7m 4s
[FIX] Add a positional argument for tree_attn (#2764)
Build Docs #482: Commit 58564d8 pushed by tqchen
August 8, 2024 14:09 8m 18s main
August 8, 2024 14:09 8m 18s
[Config] Fix model limits (#2765)
Build Docs #481: Commit 8099747 pushed by vinx13
August 7, 2024 16:54 8m 50s main
August 7, 2024 16:54 8m 50s
[Fix] Commit prefix cache update (#2761)
Build Docs #480: Commit 72f2d50 pushed by MasterJH5574
August 6, 2024 13:30 6m 57s main
August 6, 2024 13:30 6m 57s
[Engine] Initial support of pipeline parallelism (#2757)
Build Docs #479: Commit cfb6f58 pushed by MasterJH5574
August 6, 2024 04:36 7m 3s main
August 6, 2024 04:36 7m 3s
[BugFix][Config] Fix up-scaling sequence length (#2759)
Build Docs #478: Commit 1d19866 pushed by MasterJH5574
August 6, 2024 04:00 6m 42s main
August 6, 2024 04:00 6m 42s
[Android] Update Andorid APK (#2760)
Build Docs #477: Commit 6e95204 pushed by MasterJH5574
August 6, 2024 03:59 6m 32s main
August 6, 2024 03:59 6m 32s
[Engine] Reducing MLC engine CPU overhead (#2756)
Build Docs #476: Commit 3efd4de pushed by tqchen
August 5, 2024 20:20 6m 24s main
August 5, 2024 20:20 6m 24s
[Fix] Fix config parsing for rnn models like RWKV (#2750)
Build Docs #475: Commit e3877eb pushed by MasterJH5574
August 5, 2024 13:44 6m 37s main
August 5, 2024 13:44 6m 37s
[Model] Add support for Aya-23 8B Model by Cohere (#2603)
Build Docs #474: Commit c357be9 pushed by MasterJH5574
August 5, 2024 13:39 6m 6s main
August 5, 2024 13:39 6m 6s
[Docs] Changing CLI and RestAPI's Multi-GPU instructions from Note to…
Build Docs #473: Commit 8a08f14 pushed by tqchen
August 5, 2024 12:39 7m 5s main
August 5, 2024 12:39 7m 5s
[Bench] Support tensorrt-llm API backend (#2747)
Build Docs #472: Commit 0af7caf pushed by tqchen
August 5, 2024 12:39 7m 39s main
August 5, 2024 12:39 7m 39s