-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CUDA API support: cudaHostAlloc and cudaFreeHost #1304
Comments
Not in high priority. It's highly possible that this API is called by NCCL. So once we finish the NCCL support, we might not need to support this API for now. This is because I don't find this API in vLLM source code, but in NCCL source code, I find it. |
Branch Merge Issue with The branch named Log Details appearing right after crash: [INFO] [0/60932323] unmap ptr is 4000000000, len is 1000 |
@mehryar72 Thank you! Would you please provide more detail repro step and it will be great to attach whole quark log. |
@QuarkContainer |
@mehryar72 Compiling containerd-shim v0.3.0 (https://github.com/QuarkContainer/rust-extensions.git#b3ac82d9) my test in the branch hostalloc pass as below. root@brad-MS-7D46:/var/log/quark# rm quark.log; docker run --net=host --cpus=0.8 -P --runtime=quark_d --mount type=bind,source="/home/brad/rust/Quark",target=/Quark --rm -it nvidia/cuda:12.1.0-devel-ubuntu22.04 /bin/bash ==========
|
Maybe we should make NCCL as an option for building quark. Because not all cuda users require for NCCL. |
When test with latest GPUVirtNew branch the test code fail at weired place. root@brad-MS-7D46:/Quark/target/release# LD_PRELOAD=/Quark/target/release/libcudaproxy.so /Quark/test/c/test_cudahostalloc 1024 1024 |
@mehryar72 @chengchen666 with PR #1315. The cudahostalloc works as below. root@brad-MS-7D46:/var/log/quark# rm quark.log; docker run --net=host --cpus=0.8 -P --runtime=quark_d --mount type=bind,source="/home/brad/rust/Quark",target=/Quark --rm -it nvidia/cuda:12.1.0-devel-ubuntu22.04 /bin/bash ==========
|
Need to implement cudaHostAlloc and cudaFreeHost to support vLLM.
Test case is in:
16bf3d2
To build:
nvcc -cudart shared test_cudahostalloc.cpp -o test_cudahostalloc -lcuda
To Run:
./test_cudahostalloc 1024 1024
The text was updated successfully, but these errors were encountered: