Skip to content

Conversation

@hmellor
Copy link
Member

@hmellor hmellor commented Nov 18, 2025

The experts module could now also be a 3D tensor.

Note that there are other issues on Transformers main which currently break Transformers modeling backend MoE support. These are being worked on separately.

The experts model could now also be a 3D tensor

Note that there are other issues on Transformers main which currently break Transformers modeling backend MoE support. These are being worked on separately.

Signed-off-by: Harry Mellor <[email protected]>
@mergify
Copy link

mergify bot commented Nov 18, 2025

Documentation preview: https://vllm--28952.org.readthedocs.build/en/28952/

@mergify mergify bot added the documentation Improvements or additions to documentation label Nov 18, 2025
@hmellor hmellor requested a review from Isotr0py November 18, 2025 17:20
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request relaxes the check for Mixture-of-Experts (MoE) layers in the Transformers modeling backend. It now supports identifying packed expert modules where parameters are 3D tensors, in addition to the existing check for nn.ModuleList. The documentation has been updated accordingly. My review focuses on the implementation of this new check. I've found a potential edge case where a module with no parameters could be incorrectly identified as a packed expert module and have provided a suggestion to make the check more robust.

Signed-off-by: Harry Mellor <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
@Isotr0py Isotr0py added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed

Projects

Status: Todo

Development

Successfully merging this pull request may close these issues.

3 participants