Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update e2e/lm-eval test infrastructure ready When a PR is ready for review
#1323 opened Apr 3, 2025 by dbarbuzzi Loading…
bugfix kv cache quantization with ignored layers ready When a PR is ready for review
#1312 opened Apr 1, 2025 by brian-dellabetta Loading…
[NVFP4][WIP]: Add FP4 Support
#1309 opened Apr 1, 2025 by dsikka Draft
[Tracing] Allow torch.Sizes to be iterated
#1308 opened Apr 1, 2025 by kylesayrs Loading…
[Tracing] Better runtime error messages
#1307 opened Apr 1, 2025 by kylesayrs Loading…
Use align_module_device util
#1298 opened Mar 29, 2025 by kylesayrs Loading…
Update tests
#1297 opened Mar 28, 2025 by dsikka Draft
Reduce SmoothQuant Repr ready When a PR is ready for review
#1289 opened Mar 27, 2025 by kylesayrs Loading…
[BugFix] Multi-gpu temp bug fix ready When a PR is ready for review
#1286 opened Mar 26, 2025 by horheynm Draft
Smoothquant typehinting and onloading context ready When a PR is ready for review
#1285 opened Mar 26, 2025 by kylesayrs Loading…
Pipeline Extraction
#1279 opened Mar 24, 2025 by kylesayrs Draft
[Tests] Add mark skip for GPU ready When a PR is ready for review
#1264 opened Mar 18, 2025 by kylesayrs Loading…
[Performance] Sequential onloading ready When a PR is ready for review
#1263 opened Mar 18, 2025 by kylesayrs Loading…
fix lm eval test reproducbility issues
#1260 opened Mar 17, 2025 by brian-dellabetta Loading…
[Callbacks][Docs] Add docstrings to saving functions ready When a PR is ready for review
#1201 opened Feb 26, 2025 by kylesayrs Loading…
[Callbacks] Merge on_event with on_update, remove MagnitudePruningModifier.leave_enabled ready When a PR is ready for review
#1199 opened Feb 26, 2025 by kylesayrs Loading…
ProTip! Follow long discussions with comments:>50.