[CPU] [FP8 SDPA] Enable FP8 SDPA pattern match #3076

Valentine233 · 2025-09-26T08:59:59Z

Support the FP8 SDPA pattern match.
Depend on #2565.

pytorch-bot · 2025-09-26T09:00:04Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3076

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Macos CI runners unavailable

✅ No Failures

As of commit 1ad747c with merge base 233cfc1 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Valentine233 · 2025-10-10T02:08:12Z

@mingfeima Please help take a look at the PR, thanks!

mingfeima · 2025-10-10T02:15:52Z

test/prototype/inductor/test_qsdpa_fusion.py

+        self.q_out_scale = 1.5
+        self.k_out_scale = 1.5
+        self.attn_weights_scale = 1.5
+        self.v_out_scale = 1.5
+        self.attn_out_scale = 1.5
+        self.qk_out_scale = 1.5


any particular reason we use the magic number 1.5? and FP8QDQLinear is using 2.0

These values are just randomly selected.

jerryzh168 · 2025-10-14T02:53:22Z

torchao/prototype/inductor/fx_passes/qsdpa_fusion.py

+    if qtype == torch.uint8:
+        assert zp is not None, "Zero point must be provided for uint8 dequantization"
+        return CallFunction(
+            torch.ops.quantized_decomposed.dequantize_per_tensor.default,


this is still using the old op, what is the e2e for quantized sdpa?

Valentine233 marked this pull request as draft September 26, 2025 09:00

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 26, 2025

Valentine233 added topic: not user facing Use this tag if you don't want this PR to show up in release notes and removed CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. labels Sep 26, 2025

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 26, 2025

Valentine233 added 2 commits October 10, 2025 01:33

[FP8 SDPA] Enable fp8 sdpa pattern match

438aabb

fix format

1ad747c

Valentine233 force-pushed the fp8_sdpa_pattern branch from 7fc9ea4 to 1ad747c Compare October 10, 2025 02:05

Valentine233 marked this pull request as ready for review October 10, 2025 02:07

mingfeima approved these changes Oct 10, 2025

View reviewed changes

Valentine233 requested a review from jerryzh168 October 10, 2025 02:23

jerryzh168 reviewed Oct 14, 2025

View reviewed changes

jerryzh168 approved these changes Oct 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CPU] [FP8 SDPA] Enable FP8 SDPA pattern match #3076

[CPU] [FP8 SDPA] Enable FP8 SDPA pattern match #3076

Valentine233 commented Sep 26, 2025

Uh oh!

pytorch-bot bot commented Sep 26, 2025 •

edited

Loading

Uh oh!

Valentine233 commented Oct 10, 2025

Uh oh!

mingfeima Oct 10, 2025

Uh oh!

Valentine233 Oct 10, 2025

Uh oh!

jerryzh168 Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[CPU] [FP8 SDPA] Enable FP8 SDPA pattern match #3076

Are you sure you want to change the base?

[CPU] [FP8 SDPA] Enable FP8 SDPA pattern match #3076

Conversation

Valentine233 commented Sep 26, 2025

Uh oh!

pytorch-bot bot commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3076

❗ 1 Active SEVs

✅ No Failures

Uh oh!

Valentine233 commented Oct 10, 2025

Uh oh!

mingfeima Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

Valentine233 Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Sep 26, 2025 •

edited

Loading