Support power of 2 scaling factors in float8 training and use e4m3 everywhere #7893
Annotations
1 error
Run script in container
Process completed with exit code 1.
|
Loading