-
-
Notifications
You must be signed in to change notification settings - Fork 5.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Core] Add Additional Metrics to vLLM Server
needs-rebase
v1
#12726
opened Feb 4, 2025 by
sahelib25
Loading…
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support
ci/build
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#11844
opened Jan 8, 2025 by
sighingnow
Loading…
[V1] V1 engine implements parallel sampling (AsyncLLM and LLMEngine)
ci/build
documentation
Improvements or additions to documentation
frontend
v1
#10980
opened Dec 7, 2024 by
afeldman-nm
Loading…
[Doc] Add offline distributed continuous batching example
#10053
opened Nov 5, 2024 by
dakies
Loading…
chore(outputs): make return class into dataclass
unstale
#3017
opened Feb 24, 2024 by
aarnphm
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.