Skip to content

mx roofline: adjust mxfp8 formulas #1953

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 23 commits into from
Mar 27, 2025
Merged

mx roofline: adjust mxfp8 formulas #1953

merged 23 commits into from
Mar 27, 2025

Conversation

vkuzo
Copy link
Contributor

@vkuzo vkuzo commented Mar 25, 2025

Summary:

It's not clear whether we can write a fast dim0 + dim1 cast kernel, so
adjusting the roofline estimation formulas to use separate dim0 and dim1 kernels

Test Plan:

python benchmarks/float8/float8_roofline.py ~/local/tmp/20250325_b200_mxfp8_v2_triton.csv --mx_recipe_name mxfp8_cublas --shape_gen_name pow2_extended

Reviewers:

Subscribers:

Tasks:

Tags:

vkuzo added 17 commits March 21, 2025 06:59
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
Copy link

pytorch-bot bot commented Mar 25, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1953

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 25, 2025
vkuzo added a commit that referenced this pull request Mar 25, 2025
Summary:

It's not clear whether we can write a fast dim0 + dim1 cast kernel, so
adjusting the roofline estimation formulas to use separate dim0 and dim1 kernels

Test Plan:

```
python benchmarks/float8/float8_roofline.py ~/local/tmp/20250325_b200_mxfp8_v2_triton.csv --mx_recipe_name mxfp8_cublas --shape_gen_name pow2_extended

```

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 66a95b3
ghstack-comment-id: 2752441017
Pull Request resolved: #1953
@vkuzo vkuzo added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Mar 26, 2025
vkuzo added 3 commits March 26, 2025 13:55
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
vkuzo added 3 commits March 27, 2025 07:23
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
@vkuzo vkuzo changed the base branch from gh/vkuzo/78/head to main March 27, 2025 14:24
@vkuzo vkuzo merged commit fe5bf73 into main Mar 27, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants