Fix int8 zero-point overflow and conv bias_scale in eager quantized ref kernels by abeakkas · Pull Request #20655 · pytorch/executorch

abeakkas · 2026-06-30T22:52:08Z

Summary:
Fix two numerical bugs in the eager Cadence reference kernels (ref_implementations.py) that made them diverge from the deployed C++ kernels.

Int8 zero-point overflow: the dequant subtracted the zero-point while the tensor was still int8, so a negative zero-point could overflow and wrap. We now upcast before subtracting. Affects quantized_add, quantized_mul, quantized_linear, quantized_matmul, quantized_conv, and quantized_relu.
Conv bias_scale: quantized_conv_per_tensor added a pre-scaled bias onto an unscaled integer convolution accumulation, leaving the output off by ~1/bias_scale. We now add the integer bias pre-scale and dequantize the whole accumulation by bias_scale.

Also corrects the uint8 dtype-check error messages in quantized_add.

Differential Revision: D110220645

…ef kernels Summary: Fix two numerical bugs in the eager Cadence reference kernels (`ref_implementations.py`) that made them diverge from the deployed C++ kernels. 1. Int8 zero-point overflow: the dequant subtracted the zero-point while the tensor was still int8, so a negative zero-point could overflow and wrap. We now upcast before subtracting. Affects `quantized_add`, `quantized_mul`, `quantized_linear`, `quantized_matmul`, `quantized_conv`, and `quantized_relu`. 2. Conv bias_scale: `quantized_conv_per_tensor` added a pre-scaled bias onto an unscaled integer convolution accumulation, leaving the output off by ~`1/bias_scale`. We now add the integer bias pre-scale and dequantize the whole accumulation by `bias_scale`. Also corrects the uint8 dtype-check error messages in `quantized_add`. Differential Revision: D110220645

pytorch-bot · 2026-06-30T22:52:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20655

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 08ef0e5 with merge base d54a0c0 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-06-30T22:52:17Z

@abeakkas has exported this pull request. If you are a Meta employee, you can view the originating Diff in D110220645.

github-actions · 2026-06-30T22:53:00Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 30, 2026

meta-codesync Bot added the meta-exported label Jun 30, 2026

meta-codesync Bot temporarily deployed to cadence June 30, 2026 22:52 Inactive

ethansfng approved these changes Jun 30, 2026

View reviewed changes

meta-codesync Bot merged commit f95486d into pytorch:main Jul 1, 2026
192 of 198 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix int8 zero-point overflow and conv bias_scale in eager quantized ref kernels#20655

Fix int8 zero-point overflow and conv bias_scale in eager quantized ref kernels#20655
meta-codesync[bot] merged 1 commit into
pytorch:mainfrom
abeakkas:export-D110220645

abeakkas commented Jun 30, 2026

Uh oh!

pytorch-bot Bot commented Jun 30, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented Jun 30, 2026

Uh oh!

github-actions Bot commented Jun 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

abeakkas commented Jun 30, 2026

Uh oh!

pytorch-bot Bot commented Jun 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20655

✅ No Failures

Uh oh!

meta-codesync Bot commented Jun 30, 2026

Uh oh!

github-actions Bot commented Jun 30, 2026

This PR needs a release notes: label

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot Bot commented Jun 30, 2026 •

edited

Loading

This PR needs a `release notes:` label