-
-
Notifications
You must be signed in to change notification settings - Fork 7.3k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI/Build] Fix TPU V1 Test mixed use of & and && across tests
ci/build
#17968
opened May 11, 2025 by
CAROLZXYZXY
Loading…
FIX: import TypedDict from typing_extensions for python<3.12
frontend
#17967
opened May 11, 2025 by
juncgu
Loading…
[misc] add instructions on how to install nvshmem/pplx/deepep
#17964
opened May 11, 2025 by
youkaichao
Loading…
[Doc] Update pip install instruction for testing dependencies
documentation
Improvements or additions to documentation
#17963
opened May 11, 2025 by
ztang2370
Loading…
[Bugfix] Fix pydantic.errors.PydanticUserError
frontend
#17962
opened May 11, 2025 by
Potabk
Loading…
[FEAT] [ROCm] [V1]: Add AITER biased group topk for DeepSeekV3
#17955
opened May 11, 2025 by
vllmellm
Loading…
Refactor
ci/build
needs-rebase
tpu
Related to Google TPUs
v1
#17950
opened May 10, 2025 by
yarongmu-google
•
Draft
[v1] Support multiple KV cache groups in GPU model runner
ready
ONLY add when PR is ready to merge/full CI is needed
tpu
Related to Google TPUs
v1
#17945
opened May 10, 2025 by
heheda12345
Loading…
[doc] list the hf downloaded models
documentation
Improvements or additions to documentation
#17940
opened May 10, 2025 by
reidliu41
Loading…
[BugFix] Correct max_model_len derivation from config.json for Mistral format
#17937
opened May 10, 2025 by
princepride
Loading…
[Bugfix] Avoid repeatedly creating dummy data during engine startup
multi-modality
Related to multi-modality (#4194)
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#17935
opened May 10, 2025 by
DarkLight1337
Loading…
[kernel] integrate permute/unpermute kernel into deepgemm moe
#17934
opened May 10, 2025 by
CalebDu
Loading…
[Misc][RFC] Add automated profiling sweep and heatmap visualization tools
#17933
opened May 10, 2025 by
ConstBob
Loading…
[WIP] automatically bind CPU OMP Threads of a rank to CPU ids of a NUMA node.
ci/build
#17930
opened May 10, 2025 by
louie-tsai
Loading…
[Frontend] [Core] Add Tensorizer support for LoRA adapter serialization and deserialization
documentation
Improvements or additions to documentation
#17926
opened May 9, 2025 by
sangstar
Loading…
TESTING CI test completion - no need to merge.
ci/build
needs-rebase
#17921
opened May 9, 2025 by
Alexei-V-Ivanov-AMD
Loading…
[Hardware][Intel-Gaudi] enable text embedding for Intel-Gaudi backend
#17920
opened May 9, 2025 by
libinta
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-08.