-
Notifications
You must be signed in to change notification settings - Fork 63
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Computer get stucked after several correct runs #133
Comments
out of memory,you may check your memory cache usage, when it get stucked |
@D0522J Thanks for replying. Not memory issue I guess. I change the code to the following to reduce vram.
The stuck proble still happen. I check the vram and ram using the command: Also, I record ram using tensorboard every 0.01 second, the maximum ram I use is about 24G, but I have 94GB on my computer When I check process by top command, it shows: several kworker/u64xxxx-flush process are occupying the cpu, I don't know what they are... @D0522J Any ideas? |
Inference is succeed, but failed when write files... I kill the process and rerun, same stucking at write files |
Try not to connect to your server through ssh. Try running Python Example directly on the server. I don't know why, but it works. |
@D0522J Thanks for replying. I try runing the code directly on the server, but not work. |
Hi @joe-zxh How did you install deepcompressor into your pip list. Please guide me Mine get error like this: Thank you so much |
@ pivtienduc
stupid but work 😂 |
Install this nunchaku is a pain, everytime it's updated I get trouble with install the latest version again |
@lmxyy seems v0.1.4 fix this problem. Thanks a lot for your great contribution. Is it related to this modification? Why is replacing gemm w4a4 with gemm awq solve this problem? |
Dear author, you have made a great contribution to the research of diffusion models. I have great interest on your project.
However, my computer stucked at writing files after several correct runs of nunchaku:
(not only I could not save the image in the code, but also I cannot save python files or text files)
{even fix the seed, will also reproduce the problem}
Only reboot the computer could recover, but will encounter same problem after serveral runs...
The code to reproduce the problem:
My cuda version is 12.4, torch version is 2.6.0, python version is 3.11.0, os system is ubuntu 20.04
I install nunchaku using your provided wheel:
pip install nunchaku-0.1.3+torch2.6-cp311-cp311-linux_x86_64.whl
(I also try build from source, but get simialr result)
My computer environment:

My pip environment:
I am looking forward to hearing from you, thanks.
The text was updated successfully, but these errors were encountered: