Relax Transformers modeling backend MoE experts check #28952

hmellor · 2025-11-18T17:19:16Z

The experts module could now also be a 3D tensor.

Note that there are other issues on Transformers main which currently break Transformers modeling backend MoE support. These are being worked on separately.

The experts model could now also be a 3D tensor Note that there are other issues on Transformers main which currently break Transformers modeling backend MoE support. These are being worked on separately. Signed-off-by: Harry Mellor <[email protected]>

mergify · 2025-11-18T17:19:51Z

Documentation preview: https://vllm--28952.org.readthedocs.build/en/28952/

gemini-code-assist

Code Review

This pull request relaxes the check for Mixture-of-Experts (MoE) layers in the Transformers modeling backend. It now supports identifying packed expert modules where parameters are 3D tensors, in addition to the existing check for nn.ModuleList. The documentation has been updated accordingly. My review focuses on the implementation of this new check. I've found a potential edge case where a module with no parameters could be incorrectly identified as a packed expert module and have provided a suggestion to make the check more robust.

vllm/model_executor/models/transformers/moe.py

Signed-off-by: Harry Mellor <[email protected]>

vllm/model_executor/models/transformers/moe.py

Signed-off-by: Harry Mellor <[email protected]>

github-project-automation bot added this to Transformers backend Nov 18, 2025

github-project-automation bot moved this to Todo in Transformers backend Nov 18, 2025

mergify bot added the documentation Improvements or additions to documentation label Nov 18, 2025

hmellor requested a review from Isotr0py November 18, 2025 17:20

gemini-code-assist bot reviewed Nov 18, 2025

View reviewed changes

vllm/model_executor/models/transformers/moe.py Outdated Show resolved Hide resolved

hmellor added 2 commits November 18, 2025 18:23

Use class name in docs

63ab89b

Signed-off-by: Harry Mellor <[email protected]>

Review comment

c00ba95

Signed-off-by: Harry Mellor <[email protected]>

DarkLight1337 reviewed Nov 18, 2025

View reviewed changes

vllm/model_executor/models/transformers/moe.py Outdated Show resolved Hide resolved

Review comment

c795d0b

Signed-off-by: Harry Mellor <[email protected]>

Isotr0py approved these changes Nov 19, 2025

View reviewed changes

Isotr0py added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Relax Transformers modeling backend MoE experts check #28952

Relax Transformers modeling backend MoE experts check #28952

hmellor commented Nov 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Nov 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Relax Transformers modeling backend MoE experts check #28952

Are you sure you want to change the base?

Relax Transformers modeling backend MoE experts check #28952

Conversation

hmellor commented Nov 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Nov 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hmellor commented Nov 18, 2025 •

edited by github-actions bot

Loading