-
Notifications
You must be signed in to change notification settings - Fork 570
[TorchComms] integration CI tests #1927
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
pip config --user set global.progress_bar off | ||
|
||
python -m pip install --force-reinstall --pre torch torchcomms --index-url https://download.pytorch.org/whl/nightly/cu128 | ||
./build_ncclx.sh |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
don't need this line if we're installing nightly wheels
with: | ||
runner: linux.g5.48xlarge.nvidia.gpu | ||
gpu-arch-type: cuda | ||
gpu-arch-version: "12.6" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
when we install cuda for pytorch we use 128, shall we also use 12.8 here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the comments, update to 12.8
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall it looks good to me and just one comment.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, please only merge when the CI is all green, thanks!
No description provided.