Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Model] Remove redundant None check in DeepSeekOCR image input processing deepseek Related to DeepSeek models
#32016 opened Jan 9, 2026 by maang-h Loading…
[Fix] Qwen3-VL-MoE bitsandbytes 4 bit quant qwen Related to Qwen models
#32013 opened Jan 9, 2026 by Datta0 Loading…
1 of 5 tasks
[fix] add cutedsl to global sf nvidia
#32001 opened Jan 9, 2026 by jiahanc Loading…
5 tasks
Fix type error fb-exported frontend meta-exported ready ONLY add when PR is ready to merge/full CI is needed
#31999 opened Jan 8, 2026 by Adolfo-Karim Loading…
[Misc] Enable async scheduling by default with spec decoding ready ONLY add when PR is ready to merge/full CI is needed
#31998 opened Jan 8, 2026 by njhill Loading…
[CI/Build][Hardware][AMD] Fix test_forward_error rocm Related to AMD ROCm v1
#31997 opened Jan 8, 2026 by rjrock Draft
3 tasks done
[ROCM] Add ROCm image build to release pipeline ci/build rocm Related to AMD ROCm
#31995 opened Jan 8, 2026 by dllehr-amd Loading…
5 tasks
fix lora moe sharding when rank < max_lora_rank gpt-oss Related to GPT-OSS models ready ONLY add when PR is ready to merge/full CI is needed
#31994 opened Jan 8, 2026 by gnovack Loading…
[Bugfix] Fix Fp8 Triton for non-gated MoE (Nemotron)
#31983 opened Jan 8, 2026 by danisereb Loading…
5 tasks
Add mergify label job for "bug" match ci/build
#31980 opened Jan 8, 2026 by mgoin Loading…
5 tasks
[Model] Reorganize pooling layers ci/build documentation Improvements or additions to documentation qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed v1
#31973 opened Jan 8, 2026 by DarkLight1337 Loading…
5 tasks
[Models]: Make Multimodal config implicit in ViT implementation qwen Related to Qwen models
#31972 opened Jan 8, 2026 by Isotr0py Draft
5 tasks
[CPU] Add head sizes 80 and 112 with vec16 fallback cpu Related to CPU backends v1
#31968 opened Jan 8, 2026 by R3hankhan123 Loading…
5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.