-
-
Notifications
You must be signed in to change notification settings - Fork 11.4k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Support PCP (prefill context parallel) with MLA
needs-rebase
v1
#28988
opened Nov 19, 2025 by
FENP
Loading…
1 of 5 tasks
[Feat] Iteration-level profiling for Torch and CUDA profiler
nvidia
v1
#28987
opened Nov 19, 2025 by
benchislett
Loading…
[Rocm] Set VLLM_ROCM_USE_AITER_FUSION_SHARED_EXPERTS default is disabled
rocm
Related to AMD ROCm
#28985
opened Nov 19, 2025 by
zhyajie
Loading…
[ROCm][CI] Fix Weight Loading With Multiple GPU Tests on ROCm
ci/build
rocm
Related to AMD ROCm
#28984
opened Nov 19, 2025 by
micah-wil
Loading…
[cuda 13][aarch64][CI] Adding CI steps to build arm64 cuda13 nightly wheels and images
aarch64-cuda
ci/build
nvidia
#28983
opened Nov 19, 2025 by
wangshangsam
•
Draft
5 tasks
[ROCm][CI] Fixes tests for pytorch nightly and python only builds
ci/build
rocm
Related to AMD ROCm
#28979
opened Nov 19, 2025 by
AndreasKaratzas
Loading…
cleanup at::Tag::needs_fixed_stride_order
ready
ONLY add when PR is ready to merge/full CI is needed
#28974
opened Nov 19, 2025 by
BoyuanFeng
Loading…
[Feature] add session based streaming support to v1
tpu
Related to Google TPUs
v1
#28973
opened Nov 19, 2025 by
joshuadeng
•
Draft
2 of 5 tasks
[Bugfix] Fix precision loss in LoRA-wrapped RowParallelLinear by fusing bias into GEMM
#28972
opened Nov 19, 2025 by
prashanth058
Loading…
[LoRA] Support FusedMoE LoRA Triton kernel for mxfp4 model
gpt-oss
Related to GPT-OSS models
#28971
opened Nov 18, 2025 by
xyang16
Loading…
5 tasks
[DeepSeek] Fix DeepSeek V3.2 Rope Embedding
deepseek
Related to DeepSeek models
#28968
opened Nov 18, 2025 by
zyongye
Loading…
5 tasks
[Bug] Fix Batch Invariant MLA test
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#28967
opened Nov 18, 2025 by
yewentao256
Loading…
[Model][QwenVL] Replace Related to Qwen models
torch.repeat_interleave with faster np.repeat
qwen
#28964
opened Nov 18, 2025 by
lgeiger
Loading…
[Model][QwenVL] Simplify cos/sin rotary embedding indexing
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#28962
opened Nov 18, 2025 by
lgeiger
Loading…
[config] Expose ONLY add when PR is ready to merge/full CI is needed
get_total_num_hidden_layers() in ModelConfig
ready
#28961
opened Nov 18, 2025 by
ptovam
Loading…
[Bugfix] Fix typo in Qwen3 Next model executor
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#28960
opened Nov 18, 2025 by
Nepherpitou
Loading…
2 of 5 tasks
[Bugfix] Use lazy string reference for DeepseekV3Config in config registry
deepseek
Related to DeepSeek models
#28958
opened Nov 18, 2025 by
yongming-qin
Loading…
Update Dockerfile to use gcc-toolset-14 and fix test case failures on power (ppc64le)
ci/build
v1
#28957
opened Nov 18, 2025 by
bhagyashrigai
•
Draft
5 tasks
Speed up macOS smoke test
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#28954
opened Nov 18, 2025 by
mgoin
Loading…
5 tasks
Relax Transformers modeling backend MoE experts check
documentation
Improvements or additions to documentation
#28952
opened Nov 18, 2025 by
hmellor
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-10-18.