Skip to content

Actions: mlc-ai/mlc-llm

Build Docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
630 workflow runs
630 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Fix] Commit prefix cache update (#2761)
Build Docs #480: Commit 72f2d50 pushed by MasterJH5574
August 6, 2024 13:30 6m 57s main
August 6, 2024 13:30 6m 57s
[Engine] Initial support of pipeline parallelism (#2757)
Build Docs #479: Commit cfb6f58 pushed by MasterJH5574
August 6, 2024 04:36 7m 3s main
August 6, 2024 04:36 7m 3s
[BugFix][Config] Fix up-scaling sequence length (#2759)
Build Docs #478: Commit 1d19866 pushed by MasterJH5574
August 6, 2024 04:00 6m 42s main
August 6, 2024 04:00 6m 42s
[Android] Update Andorid APK (#2760)
Build Docs #477: Commit 6e95204 pushed by MasterJH5574
August 6, 2024 03:59 6m 32s main
August 6, 2024 03:59 6m 32s
[Engine] Reducing MLC engine CPU overhead (#2756)
Build Docs #476: Commit 3efd4de pushed by tqchen
August 5, 2024 20:20 6m 24s main
August 5, 2024 20:20 6m 24s
[Fix] Fix config parsing for rnn models like RWKV (#2750)
Build Docs #475: Commit e3877eb pushed by MasterJH5574
August 5, 2024 13:44 6m 37s main
August 5, 2024 13:44 6m 37s
[Model] Add support for Aya-23 8B Model by Cohere (#2603)
Build Docs #474: Commit c357be9 pushed by MasterJH5574
August 5, 2024 13:39 6m 6s main
August 5, 2024 13:39 6m 6s
[Docs] Changing CLI and RestAPI's Multi-GPU instructions from Note to…
Build Docs #473: Commit 8a08f14 pushed by tqchen
August 5, 2024 12:39 7m 5s main
August 5, 2024 12:39 7m 5s
[Bench] Support tensorrt-llm API backend (#2747)
Build Docs #472: Commit 0af7caf pushed by tqchen
August 5, 2024 12:39 7m 39s main
August 5, 2024 12:39 7m 39s
[Tool] Prelim support for function calling with Llama3.1 and Hermes2 …
Build Docs #471: Commit 03a50c2 pushed by tqchen
August 4, 2024 11:47 6m 5s main
August 4, 2024 11:47 6m 5s
[Bench] Support benchmarking for fixed request rates (#2736)
Build Docs #470: Commit fc9f389 pushed by tqchen
August 3, 2024 23:59 6m 24s main
August 3, 2024 23:59 6m 24s
[DOCS] Update convert_weights.rst (#2741)
Build Docs #469: Commit 45e7232 pushed by tqchen
August 3, 2024 15:42 6m 11s main
August 3, 2024 15:42 6m 11s
[Bench] Allow running cuda-profile on existing mlc endpoint (#2737)
Build Docs #468: Commit 01b5f68 pushed by tqchen
August 3, 2024 08:30 6m 25s main
August 3, 2024 08:30 6m 25s
[Serving] Add prefill-mode to cli option (#2735)
Build Docs #467: Commit 28a1016 pushed by MasterJH5574
August 3, 2024 03:05 8m 54s main
August 3, 2024 03:05 8m 54s
[Fix] Fix casting token data error (#2734)
Build Docs #466: Commit baf8111 pushed by tqchen
August 2, 2024 21:47 6m 52s main
August 2, 2024 21:47 6m 52s
[Bench] Adopting multi-processing to send requests (#2727)
Build Docs #465: Commit 6aff661 pushed by MasterJH5574
August 2, 2024 14:21 6m 39s main
August 2, 2024 14:21 6m 39s
[Android] Add NDK version requirement in docs to avoid build package …
Build Docs #464: Commit 4c62c8a pushed by MasterJH5574
August 2, 2024 14:01 6m 33s main
August 2, 2024 14:01 6m 33s
[Bench] Enable cuda profile (#2725)
Build Docs #463: Commit 08206d1 pushed by tqchen
August 2, 2024 11:39 6m 31s main
August 2, 2024 11:39 6m 31s
[Bench] Remove index column in csv output (#2728)
Build Docs #462: Commit 9befc0d pushed by tqchen
August 2, 2024 11:38 6m 11s main
August 2, 2024 11:38 6m 11s
[C++] Handle system_prefix_token_ids in C++ Conv template (#2723)
Build Docs #461: Commit 68cd794 pushed by tqchen
August 1, 2024 16:58 6m 57s main
August 1, 2024 16:58 6m 57s
[ConvTemplate] Update Gemma template with <bos> (#2722)
Build Docs #460: Commit 709f484 pushed by MasterJH5574
August 1, 2024 15:29 8m 14s main
August 1, 2024 15:29 8m 14s
[Bench] LLMPerf dataset (#2713)
Build Docs #459: Commit b0f2731 pushed by MasterJH5574
August 1, 2024 15:27 6m 15s main
August 1, 2024 15:27 6m 15s
Default bundle gemma2 (#2721)
Build Docs #458: Commit 97bbf52 pushed by tqchen
August 1, 2024 15:27 8m 15s main
August 1, 2024 15:27 8m 15s
[iOS] Add Gemma2 for iOS app (#2717)
Build Docs #457: Commit 59cf662 pushed by tqchen
August 1, 2024 12:40 8m 46s main
August 1, 2024 12:40 8m 46s
[Android] Update model for Andorid APK (#2718)
Build Docs #456: Commit 7296565 pushed by tqchen
August 1, 2024 12:40 6m 57s main
August 1, 2024 12:40 6m 57s