fix(inference): Add missing dtype attribute to ParameterBase setter #7378

Flink-ddd · 2025-06-22T06:21:59Z

Description

This PR fixes an AttributeError: 'UnembedParameter' object has no attribute 'dtype' that occurs in the Inference V2 engine. The issue is triggered when using a high-level interface like DeepSpeed-MII to run inference on models with tied input/output embeddings, such as Llama 2.

Resolves: #7260

Root Cause Analysis

The root cause is that while the ParameterBase metaclass correctly creates property setters for parameter tensors, the setter function (param_setter) only assigns the tensor value itself. It does not propagate the tensor's dtype to the container instance.

Downstream functions, such as flatten_inference_model, expect every parameter container to have a .dtype attribute. When they encounter a custom container like UnembedParameter that lacks this attribute, an AttributeError is raised.

The Fix

The solution is to modify the param_setter function within make_param_setter located in deepspeed/inference/v2/model_implementations/parameter_base.py.

I have added the line self.dtype = value.dtype immediately after the parameter tensor is assigned. This simple change ensures that any object inheriting from ParameterBase will now correctly expose the dtype of the tensor it wraps, resolving the error.

Verification

This fix has been thoroughly verified in a containerized GPU environment (RunPod with PyTorch 2.1). The verification process involved:

Cloning both the deepspeed and DeepSpeed-MII repositories from source.
Installing the modified deepspeed library from this branch.
Installing the DeepSpeed-MII library (with a packaging fix) to trigger the bug.
Running an end-to-end inference script with mii.pipeline and a standard language model.

The logs confirm that with this fix, the program successfully executes past the original point of failure. The AttributeError is completely resolved, and the DeepSpeed engine proceeds correctly to the model loading phase.

(Note: A full end-to-end run in the test environment was ultimately blocked by a separate, pre-existing build issue in DeepSpeed's op builder (ModuleNotFoundError: dskernels), which is unrelated to this logic fix. The successful progression past the original error point serves as definitive proof of this fix's effectiveness.)

Related Context

This bug is primarily triggered via the DeepSpeed-MII project. A companion PR, deepspeedai/DeepSpeed-MII#567, has been submitted to fix a packaging issue in that repository that was a prerequisite for this verification.

output：

Signed-off-by: Vensenmu <[email protected]>

tohtana

Thank you @Flink-ddd!

Flink-ddd · 2025-06-24T02:56:44Z

Hi @tohtana ,
Thank you so much for your time in reviewing and merging this PR! I truly appreciate it.

I just wanted to provide a little more context on the companion PR for the DeepSpeed-MII project, as it was a necessary prerequisite for verifying this fix. The PR is: deepspeedai/DeepSpeed-MII#567.

The change to setup.py in that PR was essential. During testing in a clean containerized environment, the original find_packages() call in DeepSpeed-MII's setup script unreliably failed to discover the mii package, leading to a persistent ModuleNotFoundError. Making the package discovery explicit with include=['mii', 'mii.*'] resolves this build issue, ensuring a robust installation from source and improving the developer experience.

If you have a moment, I would be grateful if you could take a look. No rush at all, and thanks again for maintaining this great project!

…eepspeedai#7378) ### Description This PR fixes an `AttributeError: 'UnembedParameter' object has no attribute 'dtype'` that occurs in the Inference V2 engine. The issue is triggered when using a high-level interface like [DeepSpeed-MII](https://github.com/deepspeedai/DeepSpeed-MII) to run inference on models with tied input/output embeddings, such as Llama 2. **Resolves: deepspeedai#7260** ### Root Cause Analysis The root cause is that while the `ParameterBase` metaclass correctly creates property setters for parameter tensors, the setter function (`param_setter`) only assigns the tensor value itself. It does not propagate the tensor's `dtype` to the container instance. Downstream functions, such as `flatten_inference_model`, expect every parameter container to have a `.dtype` attribute. When they encounter a custom container like `UnembedParameter` that lacks this attribute, an `AttributeError` is raised. ### The Fix The solution is to modify the `param_setter` function within `make_param_setter` located in `deepspeed/inference/v2/model_implementations/parameter_base.py`. I have added the line `self.dtype = value.dtype` immediately after the parameter tensor is assigned. This simple change ensures that any object inheriting from `ParameterBase` will now correctly expose the `dtype` of the tensor it wraps, resolving the error. ### Verification This fix has been thoroughly verified in a containerized GPU environment (RunPod with PyTorch 2.1). The verification process involved: 1. Cloning both the `deepspeed` and `DeepSpeed-MII` repositories from source. 2. Installing the modified `deepspeed` library from this branch. 3. Installing the `DeepSpeed-MII` library (with a packaging fix) to trigger the bug. 4. Running an end-to-end inference script with `mii.pipeline` and a standard language model. The logs confirm that with this fix, the program successfully executes past the original point of failure. The `AttributeError` is completely resolved, and the DeepSpeed engine proceeds correctly to the model loading phase. *(Note: A full end-to-end run in the test environment was ultimately blocked by a separate, pre-existing build issue in DeepSpeed's op builder (`ModuleNotFoundError: dskernels`), which is unrelated to this logic fix. The successful progression past the original error point serves as definitive proof of this fix's effectiveness.)* ### Related Context This bug is primarily triggered via the [**DeepSpeed-MII**](https://github.com/deepspeedai/DeepSpeed-MII) project. A companion PR, **[deepspeedai/DeepSpeed-MII#567](deepspeedai/DeepSpeed-MII#567, has been submitted to fix a packaging issue in that repository that was a prerequisite for this verification. output： <img width="1014" alt="Screenshot 2025-06-22 at 14 16 15" src="https://github.com/user-attachments/assets/1a658f98-a98b-4584-ae11-59e9edfd0b7e" /> <img width="1012" alt="Screenshot 2025-06-22 at 14 16 26" src="https://github.com/user-attachments/assets/3959d0e5-d6dc-4ed4-adbc-6919e00da172" /> <img width="1728" alt="Screenshot 2025-06-22 at 14 17 40" src="https://github.com/user-attachments/assets/537fd354-b840-4af2-98ab-d243c6902412" /> Signed-off-by: Vensenmu <[email protected]> Co-authored-by: Masahiro Tanaka <[email protected]>

Flink-ddd requested review from hwchen2017 and tohtana as code owners June 22, 2025 06:21

Flink-ddd mentioned this pull request Jun 22, 2025

fix(setup): Explicitly include 'mii' package in find_packages deepspeedai/DeepSpeed-MII#567

Merged

Flink-ddd force-pushed the fix/issue-7260 branch from 5320739 to 81301c9 Compare June 22, 2025 06:32

fix(inference): add dtype to setter and include mii in packages

fc1dda0

Signed-off-by: Vensenmu <[email protected]>

Flink-ddd force-pushed the fix/issue-7260 branch from 81301c9 to fc1dda0 Compare June 23, 2025 07:22

tohtana approved these changes Jun 23, 2025

View reviewed changes

tohtana added 2 commits June 23, 2025 10:11

Merge branch 'master' into fix/issue-7260

59662de

Merge branch 'master' into fix/issue-7260

558a2e2

tohtana enabled auto-merge (squash) June 23, 2025 23:17

tohtana merged commit 61829b5 into deepspeedai:master Jun 23, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(inference): Add missing dtype attribute to ParameterBase setter #7378

fix(inference): Add missing dtype attribute to ParameterBase setter #7378

Uh oh!

Flink-ddd commented Jun 22, 2025 •

edited

Loading

Uh oh!

tohtana left a comment

Uh oh!

Uh oh!

Flink-ddd commented Jun 24, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

fix(inference): Add missing dtype attribute to ParameterBase setter #7378

fix(inference): Add missing dtype attribute to ParameterBase setter #7378

Uh oh!

Conversation

Flink-ddd commented Jun 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Root Cause Analysis

The Fix

Verification

Related Context

Uh oh!

tohtana left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Flink-ddd commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Flink-ddd commented Jun 22, 2025 •

edited

Loading

Flink-ddd commented Jun 24, 2025 •

edited

Loading