PyTorch CUDA error when running on CPU-- GPU works #586

Xorrongo · 2023-03-26T19:41:06Z

Xorrongo
Mar 26, 2023

This is kinda strange: I can load or run 4 bit or original models on either CPU or GPU. Things work find on the GPU (4090) and I have coda installed etc. however, when I run on the cpu, with this command line,

python server.py --model llama-65b-4bit-128g --wbits 4 --groupsize 128 --cpu
The model loads, and I can attempt to run it. However I get no output (just spits back the prompt) and instead get this:

RuntimeError: t == DeviceType::CUDA INTERNAL ASSERT FAILED at "C:\Users\XXX\miniconda3\envs\textgen\lib\site-packages\torch\include\c10/cuda/impl/CUDAGuardImpl.h":25, please report a bug to PyTorch.

I get output as expected when doing the same thing without the --cpu (for models that fit onto the VRAM).
I have 100G RAM and am not running out of memory, and the process does not die (it just doesn't generate any text.)
Anyone have a clue? Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyTorch CUDA error when running on CPU-- GPU works #586

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

PyTorch CUDA error when running on CPU-- GPU works #586

Uh oh!

Xorrongo Mar 26, 2023

Replies: 0 comments

Xorrongo
Mar 26, 2023