Skip to content

Actions: ywang96/vllm

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
28 workflow runs
28 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Quant] BaiChuan SupportsQuant (#13710)
pre-commit #29: Commit d5ca211 pushed by ywang96
February 23, 2025 07:50 4m 49s main
February 23, 2025 07:50 4m 49s
[BugFix] Minor: logger import in attention backend (#13706)
pre-commit #28: Commit 322d2a2 pushed by ywang96
February 23, 2025 00:55 5m 54s main
February 23, 2025 00:55 5m 54s
[CI/Build] fix uv caching in Dockerfile (#13611)
pre-commit #27: Commit 78ac0f5 pushed by ywang96
February 22, 2025 22:25 5m 31s main
February 22, 2025 22:25 5m 31s
[Attention] MLA with chunked prefill (#12639)
pre-commit #26: Commit 288cc6c pushed by ywang96
February 22, 2025 02:43 3m 54s main
February 22, 2025 02:43 3m 54s
[Frontend] Add backend-specific options for guided decoding (#13505)
pre-commit #25: Commit bfbc0b3 pushed by ywang96
February 21, 2025 00:42 4m 48s main
February 21, 2025 00:42 4m 48s
[V1][Minor] Print KV cache size in token counts (#13596)
pre-commit #24: Commit d3ea501 pushed by ywang96
February 20, 2025 17:35 6m 1s main
February 20, 2025 17:35 6m 1s
[CI/Build] Use uv in the Dockerfile (#13566)
pre-commit #23: Commit 497bc83 pushed by ywang96
February 20, 2025 07:12 4m 55s main
February 20, 2025 07:12 4m 55s
[core] add sleep and wake up endpoint and v1 support (#12987)
pre-commit #22: Commit ba81163 pushed by ywang96
February 20, 2025 04:42 4m 40s main
February 20, 2025 04:42 4m 40s
[V1][Core] Generic mechanism for handling engine utility (#13060)
pre-commit #21: Commit caf7ff4 pushed by ywang96
February 19, 2025 09:12 4m 55s main
February 19, 2025 09:12 4m 55s
[Hardware][Gaudi][Feature] Support Contiguous Cache Fetch (#12139)
pre-commit #20: Commit d0a7a27 pushed by ywang96
February 19, 2025 04:03 4m 55s main
February 19, 2025 04:03 4m 55s
[Doc] [2/N] Add Fuyu E2E example for multimodal processor (#13331)
pre-commit #19: Commit 367cb8c pushed by ywang96
February 15, 2025 23:01 4m 45s main
February 15, 2025 23:01 4m 45s
[V1][Sampler] Don't apply temp for greedy-only (#13311)
pre-commit #18: Commit 6a854c7 pushed by ywang96
February 15, 2025 03:32 5m 34s main
February 15, 2025 03:32 5m 34s
[Misc] Bump the compressed-tensors version (#12736)
pre-commit #17: Commit 686006a pushed by ywang96
February 5, 2025 04:49 4m 43s main
February 5, 2025 04:49 4m 43s
[VLM] Merged multi-modal processor for InternVL-based models (#12553)
pre-commit #16: Commit d1ca7df pushed by ywang96
February 4, 2025 08:52 4m 48s main
February 4, 2025 08:52 4m 48s
Support Pixtral-Large HF by using llava multimodal_projector_bias con…
pre-commit #15: Commit 5d98d56 pushed by ywang96
February 4, 2025 05:47 4m 39s main
February 4, 2025 05:47 4m 39s
[Core] Improve hash collision avoidance in prefix caching (#12621)
pre-commit #14: Commit 73b35cc pushed by ywang96
February 4, 2025 03:19 4m 31s main
February 4, 2025 03:19 4m 31s
[Misc] Add SPDX-License-Identifier headers to python source files (#1…
pre-commit #12: Commit e489ad7 pushed by ywang96
February 2, 2025 22:24 5m 25s main
February 2, 2025 22:24 5m 25s
[Core][v1] Unify allocating slots in prefill and decode in KV cache m…
pre-commit #11: Commit f8ece6e pushed by ywang96
February 2, 2025 10:08 5m 23s main
February 2, 2025 10:08 5m 23s
Qwen2 5 vl new vit
pre-commit #10: Pull request #1 synchronize by yixqiao
February 2, 2025 07:25 4m 8s qwen2_5_vl_new_vit
February 2, 2025 07:25 4m 8s
Qwen2 5 vl new vit
pre-commit #9: Pull request #1 synchronize by yixqiao
February 2, 2025 07:22 4m 7s qwen2_5_vl_new_vit
February 2, 2025 07:22 4m 7s
Qwen2 5 vl new vit
pre-commit #8: Pull request #1 synchronize by yixqiao
February 2, 2025 02:05 4m 10s qwen2_5_vl_new_vit
February 2, 2025 02:05 4m 10s
Qwen2 5 vl new vit
pre-commit #7: Pull request #1 opened by yixqiao
February 1, 2025 10:30 4m 8s qwen2_5_vl_new_vit
February 1, 2025 10:30 4m 8s
[ROCm][AMD][Model] llama 3.2 support upstreaming (#12421)
pre-commit #6: Commit a1fc18c pushed by ywang96
January 31, 2025 06:30 4m 34s main
January 31, 2025 06:30 4m 34s
[V1][Metrics] Add GPU cache usage % gauge (#12561)
pre-commit #5: Commit f17f1d4 pushed by ywang96
January 30, 2025 06:04 5m 15s main
January 30, 2025 06:04 5m 15s
[Feature] [Spec decode]: Enable MLPSpeculator/Medusa and `prompt_logp…
pre-commit #4: Commit 6116ca8 pushed by ywang96
January 28, 2025 00:06 4m 33s main
January 28, 2025 00:06 4m 33s