Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP][Model] Add Ernie4.5 VL Model Support documentation Improvements or additions to documentation new-model Requests to new models
#22514 opened Aug 8, 2025 by CSWYF3634076 Loading…
Fix Llama4 FlashInfer FP4 MoE issues llama Related to Llama models
#22511 opened Aug 8, 2025 by nvpohanh Loading…
3 of 4 tasks
[Platform] Custom ops update
#22509 opened Aug 8, 2025 by wangxiyuan Loading…
3 of 4 tasks
[oss] Init gpt-oss bf16 support deepseek Related to DeepSeek models performance Performance-related issues
#22508 opened Aug 8, 2025 by jeejeelee Loading…
2 of 6 tasks
[XPU] Fix OOM issue for data parallel with Ray backend v1
#22500 opened Aug 8, 2025 by faaany Loading…
3 of 4 tasks
[Misc] Further refine type annotations in parallel state ready ONLY add when PR is ready to merge/full CI is needed
#22499 opened Aug 8, 2025 by DarkLight1337 Loading…
1 of 4 tasks
[CI] Add end-to-end V1 min_tokens test coverage v1
#22495 opened Aug 8, 2025 by arjunbreddy22 Loading…
3 of 4 tasks
consistency between the test and final Docker image ci/build rocm Related to AMD ROCm
#22490 opened Aug 8, 2025 by pramenku Loading…
Feat/sliding window metrics — Related to #22480 v1
#22488 opened Aug 8, 2025 by NumberWan Loading…
4 of 5 tasks
[Structured Output] Make the output of structured output example more complete documentation Improvements or additions to documentation structured-output
#22481 opened Aug 8, 2025 by shen-shanshan Loading…
1 of 4 tasks
vllm fix check on max vocab size v1
#22471 opened Aug 7, 2025 by xw285cornell Loading…
[Docs] Rename “Distributed inference and serving” to “Parallelism & Scaling” documentation Improvements or additions to documentation
#22466 opened Aug 7, 2025 by crypdick Loading…
Fix loading of quantized BigCode models
#22463 opened Aug 7, 2025 by eldarkurtic Loading…
[Feature] add procese set cpu affinity current gpu device v1
#22461 opened Aug 7, 2025 by lengrongfu Loading…
1 of 4 tasks
Fix nvfp4 swizzling
#22450 opened Aug 7, 2025 by yiliu30 Loading…
4 tasks
[Bugfix] Added more env vars to hash
#22449 opened Aug 7, 2025 by nvjullin Loading…
4 tasks
support silu+nvfp4 quant fusion ci/build
#22448 opened Aug 7, 2025 by stickingjh Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.