-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[ROCm][Kernel] Using the correct warp_size value
ready
ONLY add when PR is ready to merge/full CI is needed
#12789
opened Feb 5, 2025 by
gshtras
Loading…
Update chat_utils.py to avoid issues when tool call is present but None
frontend
#12788
opened Feb 5, 2025 by
jgreer013
Loading…
[core][V1] pipeline parallel with threads
needs-rebase
v1
#12787
opened Feb 5, 2025 by
ruisearch42
•
Draft
[VLM] Update compatibility with transformers 4.49
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#12781
opened Feb 5, 2025 by
DarkLight1337
Loading…
[Hardware][Intel-Gaudi] Multi-step scheduling implementation for HPU
#12779
opened Feb 5, 2025 by
tzielinski-habana
Loading…
[Kernel] Make rotary_embedding ops more flexible with input shape
ready
ONLY add when PR is ready to merge/full CI is needed
#12777
opened Feb 5, 2025 by
Isotr0py
Loading…
2 tasks done
Use ONLY add when PR is ready to merge/full CI is needed
RMSNorm
in TransformersModel
ready
#12776
opened Feb 5, 2025 by
hmellor
Loading…
[Model][Speculative Decoding] DeepSeek MTP spec decode
speculative-decoding
#12755
opened Feb 4, 2025 by
luccafong
Loading…
[Frontend] Adding the "User Defined Custom Tool Calling" parser for the Llama models
frontend
#12752
opened Feb 4, 2025 by
lulmer
Loading…
[Bugfix] Env var to to disable xgrammar any_whitespace
needs-rebase
structured-output
#12744
opened Feb 4, 2025 by
wallashss
Loading…
[Build] Do not add cmake/ninja dependencies when they are installed
ci/build
#12739
opened Feb 4, 2025 by
mgorny
Loading…
Merge similar examples in Improvements or additions to documentation
offline_inference
into single basic
example
documentation
#12737
opened Feb 4, 2025 by
hmellor
Loading…
[Bugfix] Fix disagg hang caused by the prefill and decode communication issues
#12723
opened Feb 4, 2025 by
houseroad
Loading…
[core] V1 pipeline parallel async execution loop
needs-rebase
v1
#12720
opened Feb 4, 2025 by
ruisearch42
•
Draft
[Model] Add support for partial rotary embeddings in Phi3 model
#12718
opened Feb 4, 2025 by
garg-amit
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.