-
-
Notifications
You must be signed in to change notification settings - Fork 9.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP][Model] Add Ernie4.5 VL Model Support
documentation
Improvements or additions to documentation
new-model
Requests to new models
#22514
opened Aug 8, 2025 by
CSWYF3634076
Loading…
[gpt-oss] Small bug fixes for frontend
frontend
needs-rebase
v1
#22512
opened Aug 8, 2025 by
heheda12345
Loading…
3 of 4 tasks
Fix Llama4 FlashInfer FP4 MoE issues
llama
Related to Llama models
#22511
opened Aug 8, 2025 by
nvpohanh
Loading…
3 of 4 tasks
[oss] Init gpt-oss bf16 support
deepseek
Related to DeepSeek models
performance
Performance-related issues
#22508
opened Aug 8, 2025 by
jeejeelee
Loading…
2 of 6 tasks
[XPU] Fix OOM issue for data parallel with Ray backend
v1
#22500
opened Aug 8, 2025 by
faaany
Loading…
3 of 4 tasks
[Misc] Further refine type annotations in parallel state
ready
ONLY add when PR is ready to merge/full CI is needed
#22499
opened Aug 8, 2025 by
DarkLight1337
Loading…
1 of 4 tasks
[Debugging] Add annotation for easier trace analysis
v1
#22496
opened Aug 8, 2025 by
dayeol
Loading…
[CI] Add end-to-end V1 min_tokens test coverage
v1
#22495
opened Aug 8, 2025 by
arjunbreddy22
Loading…
3 of 4 tasks
consistency between the test and final Docker image
ci/build
rocm
Related to AMD ROCm
#22490
opened Aug 8, 2025 by
pramenku
Loading…
Feat/sliding window metrics — Related to #22480
v1
#22488
opened Aug 8, 2025 by
NumberWan
Loading…
4 of 5 tasks
[Structured Output] Make the output of structured output example more complete
documentation
Improvements or additions to documentation
structured-output
#22481
opened Aug 8, 2025 by
shen-shanshan
Loading…
1 of 4 tasks
[WIP][Attention] FA3 Attention Sinks Perf Boost
ci/build
#22478
opened Aug 8, 2025 by
LucasWilkinson
•
Draft
4 tasks
[Refactor] Refactor FP8 & INT8 Quant Folder inside
8bit
ci/build
#22474
opened Aug 7, 2025 by
yewentao256
Loading…
[V1][P/D]Bug fix: handle edge case where KVConnectorOutput is None
v1
#22473
opened Aug 7, 2025 by
liuzijing2014
Loading…
[Docs] Rename “Distributed inference and serving” to “Parallelism & Scaling”
documentation
Improvements or additions to documentation
#22466
opened Aug 7, 2025 by
crypdick
Loading…
[Feature] add procese set cpu affinity current gpu device
v1
#22461
opened Aug 7, 2025 by
lengrongfu
Loading…
1 of 4 tasks
[V1][Metrics][Plugin] Add plugin support for custom Improvements or additions to documentation
v1
StatLoggerBase
implementations
ci/build
documentation
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.