Skip to content

Conversation

@wangxiyuan
Copy link
Collaborator

@wangxiyuan wangxiyuan commented Jan 30, 2026

Pick from #6310 to fix rope error for GLM

Signed-off-by: wangxiyuan <[email protected]>
@wangxiyuan wangxiyuan added ready read for review ready-for-test start test by label for PR labels Jan 30, 2026
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request cherry-picks a fix for the rotary position embedding (RoPE) operation on Ascend NPUs. The change addresses a bug in handling partial rotary embeddings where rotary_dim is smaller than head_size. The fix correctly passes self.rotary_dim instead of self.head_size to the _npu_rotary_embedding function for the sliced tensors. This ensures the underlying kernel operates on the correct dimension, which is critical for model correctness. The change is accurate and necessary.

@wangxiyuan wangxiyuan merged commit 70f0a06 into vllm-project:releases/v0.13.0 Jan 30, 2026
18 checks passed
starmountain1997 pushed a commit to starmountain1997/vllm-ascend that referenced this pull request Jan 31, 2026
SkychenLee pushed a commit to SkychenLee/vllm-ascend that referenced this pull request Jan 31, 2026
Pick from vllm-project#6310 to fix rope error for GLM

Signed-off-by: wangxiyuan <[email protected]>
Signed-off-by: l00832868 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready read for review ready-for-test start test by label for PR

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant