[benchmarks] Add inference-only roofline for float8 #3167

drisspg · 2025-10-13T17:21:30Z

Summary

  # Float8 tensorwise (default)
 python benchmarks/float8/float8_inference_roofline.py benchmarks/data/output.csv --recipe_name=tensorwise --do_benchmarks=False

  # Float8 rowwise  
 python benchmarks/float8/float8_inference_roofline.py benchmarks/data/output.csv --recipe_name=rowwise --do_benchmarks=False

  # MX format
 python benchmarks/float8/float8_inference_roofline.py benchmarks/data/output.csv --recipe_name=mxfp8 --do_benchmarks=False
  
  # Nvfp4
 python benchmarks/float8/float8_inference_roofline.py benchmarks/data/output.csv --recipe_name=nvfp4 --do_benchmarks=False

pytorch-bot · 2025-10-13T17:21:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3167

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[ROCm][CI] Machines under the label linux.rocm.gpu.2 are undergoing maintenance.

✅ No Failures

As of commit 83ce813 with merge base 596da93 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vkuzo · 2025-10-13T19:02:14Z

thoughts about putting it here instead? https://github.com/pytorch/ao/blob/main/benchmarks/float8/float8_inference_roofline.py

drisspg · 2025-10-13T19:07:31Z

whoops didnt see that one yeah I can move over

benchmarks/float8/float8_inference_roofline.py

torchao/testing/training/roofline_utils.py

vkuzo · 2025-10-14T00:18:27Z

torchao/testing/training/roofline_utils.py

+        # kernel 1:               x_bf16 -> x_nvfp4 (per-tensor scaling for inference)
+        kernel_1_rw = BYTES_PER_EL_BF16 * numel + BYTES_PER_EL_FLOAT4 * numel
+        # add minimal scaling overhead (per-tensor scale)
+        kernel_1_rw += BYTES_PER_EL_FLOAT32  # single scale factor


nit: add the blockwise scaling write here too, since we are adding it for mxfp8? just to be consistent, I don't think it will change the numbers that much

vkuzo

nice!

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 13, 2025

drisspg force-pushed the inference-nums branch 2 times, most recently from d2a0328 to 39739d2 Compare October 13, 2025 17:53

drisspg requested a review from vkuzo October 13, 2025 17:57

drisspg added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Oct 13, 2025

drisspg force-pushed the inference-nums branch from 39739d2 to 1f5d105 Compare October 13, 2025 21:48

vkuzo reviewed Oct 14, 2025

View reviewed changes

benchmarks/float8/float8_inference_roofline.py Outdated Show resolved Hide resolved

vkuzo reviewed Oct 14, 2025

View reviewed changes

torchao/testing/training/roofline_utils.py Outdated Show resolved Hide resolved

vkuzo reviewed Oct 14, 2025

View reviewed changes

vkuzo approved these changes Oct 14, 2025

View reviewed changes

drisspg force-pushed the inference-nums branch 2 times, most recently from 201de29 to e307ded Compare October 14, 2025 04:05

add infernece only roofline

83ce813

drisspg force-pushed the inference-nums branch from e307ded to 83ce813 Compare October 14, 2025 04:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[benchmarks] Add inference-only roofline for float8 #3167

[benchmarks] Add inference-only roofline for float8 #3167

drisspg commented Oct 13, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 13, 2025 •

edited

Loading

Uh oh!

vkuzo commented Oct 13, 2025

Uh oh!

drisspg commented Oct 13, 2025

Uh oh!

Uh oh!

Uh oh!

vkuzo Oct 14, 2025

Uh oh!

vkuzo left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[benchmarks] Add inference-only roofline for float8 #3167

Are you sure you want to change the base?

[benchmarks] Add inference-only roofline for float8 #3167

Conversation

drisspg commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

pytorch-bot bot commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3167

❗ 1 Active SEVs

✅ No Failures

Uh oh!

vkuzo commented Oct 13, 2025

Uh oh!

drisspg commented Oct 13, 2025

Uh oh!

Uh oh!

Uh oh!

vkuzo Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

vkuzo left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

drisspg commented Oct 13, 2025 •

edited

Loading

pytorch-bot bot commented Oct 13, 2025 •

edited

Loading