Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

amdgpu system freeze #54052

Open
CaioFrancisco opened this issue Jan 20, 2025 · 5 comments
Open

amdgpu system freeze #54052

CaioFrancisco opened this issue Jan 20, 2025 · 5 comments
Labels
bug Something isn't working needs-testing Testing a PR or reproducing an issue needed

Comments

@CaioFrancisco
Copy link

CaioFrancisco commented Jan 20, 2025

Is this a new report?

No

System Info

Void 6.12.9_1 x86_64 AuthenticAMD notuptodate rrrmFFFFFF

Package(s) Affected

linux-firmware-amd-20250109_1

Does a report exist for this bug with the project's home (upstream) and/or another distro?

I'm unsure.

Expected behaviour

This issue is a dupe of #53787. I'm still suffering from the freezes even though the original OP said they stopped having crashes. I asked them to re-open the issue since I'm still suffering from it, but it's been a week and I'm unsure if they'll do it. At any rate, there's no updates since my first post.

If this isn't resolved, I'll be switching distros soon enough and I'm probably not coming back. What's the use of a computer that constantly crashes on me?

Actual behaviour

Quoting myself from my previous post,

currently running XFCE on X11, i also ran kde plasma X11 some time ago, which also crashed just the same way. weirdly enough, it might just be luck, but kde plasma wayland never crashed on me.

my system specs are ryzen 5 2400g and nvidia GTX 1650 GPU, and that the crash can happen randomly. i can sometimes do video intensive tasks for hours without a single hiccup, but sometimes i can crash 10 minutes after booting up while using my browser.

as a last note, the dmesg logs have some errors when the system "freezes" (i can still ssh my way in with my phone). they usually are prety quiet up until something like this happens:

[  422.608105] amdgpu 0000:08:00.0: amdgpu: failed to write reg 28b4 wait reg 28c6
[  422.876854] [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:73:crtc-0] hw_done or flip_done timed out
[  433.117233] [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:77:crtc-1] hw_done or flip_done timed out
[  434.689642] amdgpu 0000:08:00.0: amdgpu: failed to write reg 1a6f4 wait reg 1a706
[  454.621453] amdgpu 0000:08:00.0: amdgpu: Dumping IP State

followed by the watchdog freaking out at the CPU threads getting stuck until i reisub.

Steps to reproduce

  1. Use your device for a random amount of time.
  2. Freeze.
@CaioFrancisco CaioFrancisco added bug Something isn't working needs-testing Testing a PR or reproducing an issue needed labels Jan 20, 2025
@Laitinlok
Copy link

I have the same freezes from Opensuse TW.

@CaioFrancisco
Copy link
Author

By the way, I won't be able to provide logs anymore. I switched to Fedora for the time being. The crashes have stopped happening in this distro.

@CaioFrancisco
Copy link
Author

CaioFrancisco commented Jan 22, 2025

Well, that was short. Soon after I installed mesa, the freeze happened again, in Fedora. After doing one Duckduckgo search about amdgpu and mesa, I already found a massive thread about this issue. A lot of people are reporting success with this patch, though. I'll re-install Void and try it out myself.

@ACR-Jeff
Copy link
Contributor

Hope this will be of some help, A few weeks ago I had the same issues, I posted an issue without any solution, I continued testing and I had timeshift backups prior and after updating as good measure, Holding one package at a time back until no more freezes, Holding libplacebo back on a previous version and updating everything else seemed to have solved it on my end for that time being, Later that same day a revert was pushed to revert to mesa-24.2.8_2, I followed through with reverting, I am not sure if that solved the freezes or an updated/downgraded libplacebo, I don't remember, I also added GRUB_CMDLINE_LINUX_DEFAULT==amdgpu.ppfeaturemask=0xffffffff to my grub and updating grub, But all seems to work now.

@Laitinlok
Copy link

It seems like the frame buffer for the igpu is too small and causes crashes, try setting the uma buffer size to 2gb.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working needs-testing Testing a PR or reproducing an issue needed
Projects
None yet
Development

No branches or pull requests

3 participants