Support power of 2 scaling factors in float8 training and use e4m3 everywhere #1670
Facebook GitHub Tools / Meta Internal-Only Changes Check
succeeded
Feb 6, 2025 in 0s
There is no internal Diff connected, this can be merged now
Loading