You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
It's not clear whether we can write a fast dim0 + dim1 cast kernel, so
adjusting the roofline estimation formulas to use separate dim0 and dim1 kernels
Test Plan:
```
python benchmarks/float8/float8_roofline.py ~/local/tmp/20250325_b200_mxfp8_v2_triton.csv --mx_recipe_name mxfp8_cublas --shape_gen_name pow2_extended
```
Reviewers:
Subscribers:
Tasks:
Tags:
ghstack-source-id: 66a95b3
ghstack-comment-id: 2752441017
Pull Request resolved: #1953
0 commit comments