-
Notifications
You must be signed in to change notification settings - Fork 109
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Sequential] Support models with nested
_no_split_modules
#1329
opened Apr 6, 2025 by
kylesayrs
Loading…
fix: Make Recipe.model_dump() output compatible with model_validate()
#1328
opened Apr 6, 2025 by
ved1beta
Loading…
docs: fix missing git clone command and repo name typos in DEVELOPING.md
#1325
opened Apr 4, 2025 by
gattshjott
Loading…
fix(logger): normalize log_file_level input for consistency
#1324
opened Apr 4, 2025 by
gattshjott
Loading…
Update e2e/lm-eval test infrastructure
ready
When a PR is ready for review
#1323
opened Apr 3, 2025 by
dbarbuzzi
Loading…
bugfix kv cache quantization with ignored layers
ready
When a PR is ready for review
#1312
opened Apr 1, 2025 by
brian-dellabetta
Loading…
[Tracing] Remove
TraceableWhisperForConditionalGeneration
#1310
opened Apr 1, 2025 by
kylesayrs
Loading…
Reduce SmoothQuant Repr
ready
When a PR is ready for review
#1289
opened Mar 27, 2025 by
kylesayrs
Loading…
Smoothquant typehinting and onloading context
ready
When a PR is ready for review
#1285
opened Mar 26, 2025 by
kylesayrs
Loading…
[Tests] Add mark skip for GPU
ready
When a PR is ready for review
#1264
opened Mar 18, 2025 by
kylesayrs
Loading…
[Performance] Sequential onloading
ready
When a PR is ready for review
#1263
opened Mar 18, 2025 by
kylesayrs
Loading…
[Quantization] Channel-wise Output Activation Quantization for Attention QKV Modules + KV-cache channel quantization
ready
When a PR is ready for review
[Callbacks][Docs] Add docstrings to saving functions
ready
When a PR is ready for review
#1201
opened Feb 26, 2025 by
kylesayrs
Loading…
[Callbacks] Merge When a PR is ready for review
on_event
with on_update
, remove MagnitudePruningModifier.leave_enabled
ready
#1199
opened Feb 26, 2025 by
kylesayrs
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.