You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
These workarounds are no longer needed after
#2356 and the corresponding
improvements in PyTorch core.
Test Plan:
torchtitan bench on llama 3 8b on 8 H100s:
before
rowwise
Median Tokens/Second (excluding step 1): 7013.0
Max Memory Usage: 37.19 GiB
gw_hp
Median Tokens/Second (excluding step 1): 7232.0
Max Memory Usage: 37.13 GiB
after
rowwise
Median Tokens/Second (excluding step 1): 6984.5
Max Memory Usage: 37.19 GiB
gw_hp
Median Tokens/Second (excluding step 1): 7319.5
Max Memory Usage: 37.13 GiB
Reviewers:
Subscribers:
Tasks:
Tags:
ghstack-source-id: ae11ea7
ghstack-comment-id: 3113561383
Pull Request resolved: #2595
0 commit comments