-
Notifications
You must be signed in to change notification settings - Fork 860
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] rename enable_flash_comm_v1 back to enable_sp
module:core
module:ops
#6883
opened Feb 28, 2026 by
realliujiaxu
Loading…
2 tasks
[bugs] fix pass bug: pass really rope dim for npu_rotary_embedding
#6880
opened Feb 28, 2026 by
aipaes
Loading…
[Feat][310p] 310P support w8a8s quantization and saving w8a8sc state
module:tests
#6878
opened Feb 28, 2026 by
pu-zhe
Loading…
[EPLB] Display the expert hotness comparison before and after eplb.
#6877
opened Feb 28, 2026 by
shenchuxiaofugui
Loading…
[BugFix][PCP] Fix presion bugs for pcp/dcp in PD disaggregate
#6876
opened Feb 28, 2026 by
YzTongNiar
Loading…
Revert "[Feature][Quant] Auto-detect quantization format from model f…
module:core
module:quantization
module:tests
#6873
opened Feb 28, 2026 by
Potabk
Loading…
[doc] Update GLM4.x.md, add GLM4.x multi-node deploy tutorial
documentation
Improvements or additions to documentation
#6872
opened Feb 28, 2026 by
s-zk
Loading…
Fix RoPE shape mismatch for mtp models with flashcomm v1 enabled
module:ops
#6870
opened Feb 28, 2026 by
Zhujiyang2
Loading…
[KV Pool][Feature] Add support for Yuanrong backend.
documentation
Improvements or additions to documentation
#6869
opened Feb 28, 2026 by
yangsonglin13
Loading…
[v0.13.0][CI] Upgrade to CANN 8.5.1
ready
read for review
ready-for-test
start test by label for PR
#6865
opened Feb 28, 2026 by
wxsIcey
Loading…
[Refactor][EAGLE] 8/N support the merged graph for mtp
ready
read for review
ready-for-test
start test by label for PR
#6860
opened Feb 28, 2026 by
slippersss
Loading…
Fix: Add select_experts method to ZeroExpertFusedMoE for Ascend optim…
module:core
module:ops
#6854
opened Feb 27, 2026 by
ZWJason
Loading…
[300I][Bugfix] fix unquant model weight nd2nz error
module:core
#6851
opened Feb 27, 2026 by
Tflowers-0129
Loading…
[Bugfix]: Fix AttributeError when using MTP with sparse attention (SFA) and context parallelism
ready
read for review
ready-for-test
start test by label for PR
#6850
opened Feb 27, 2026 by
Potabk
Loading…
[300I] support decode-only aclgraph mode
module:core
ready
read for review
ready-for-test
start test by label for PR
#6849
opened Feb 27, 2026 by
Tflowers-0129
Loading…
[Test] Add e2e test cases for the Qwen-VL model adaptation to Ascend 310p
module:tests
#6845
opened Feb 27, 2026 by
wanghengkang
Loading…
[Perf] Optimize MTP execution by reordering state update operation
#6844
opened Feb 27, 2026 by
SlightwindSec
Loading…
[300I][Bugfix] fix nz unquant error
module:core
module:tests
#6843
opened Feb 27, 2026 by
Tflowers-0129
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.