[RESPONDED] Error waiting for DMUB idle: status=3

Ghett_Klapson · April 3, 2024, 4:05am

I have hardware.enableAllFirmware set, which should be equivalent, but I’ll try the other option

Niko_Cantero · April 5, 2024, 8:29pm

Same issue:

Status 2 though
NixOS as well, unstable, kernel 6.8.3

Niko_Cantero · April 10, 2024, 7:47pm

Did it work?

Niko_Cantero · April 10, 2024, 11:31pm

This wouldn’t work, NixOS already enables hardware.enableRedistributableFirmware via nixos-generate-config by default in:

github.com

NixOS/nixpkgs/blob/c46343615d8a970d8a687c5c63f010bc018fd45e/nixos/modules/installer/tools/nixos-generate-config.pl#L307


      
          
          # Pull in NixOS configuration for containers.
          if ($virt eq "systemd-nspawn") {
              push @attrs, "boot.isContainer = true;";
          }
          
          
          # Check if we're on bare metal, not in a VM/container.
          if ($virt eq "none") {
              # Provide firmware for devices that are not detected by this script.
              push @imports, "(modulesPath + \"/installer/scan/not-detected.nix\")";
          
              # Update the microcode.
              push @attrs, "hardware.cpu.amd.updateMicrocode = lib.mkDefault config.hardware.enableRedistributableFirmware;" if cpuManufacturer "AuthenticAMD";
              push @attrs, "hardware.cpu.intel.updateMicrocode = lib.mkDefault config.hardware.enableRedistributableFirmware;" if cpuManufacturer "GenuineIntel";
          }
          
          # For a device name like /dev/sda1, find a more stable path like
          # /dev/disk/by-uuid/X or /dev/disk/by-label/Y.
          sub findStableDevPath {
              my ($dev) = @_;

github.com

NixOS/nixpkgs/blob/master/nixos/modules/installer/scan/not-detected.nix

# Enables non-free firmware on devices not recognized by `nixos-generate-config`.
{ lib, ... }:

{
  hardware.enableRedistributableFirmware = lib.mkDefault true;
}

Zyansheep · April 11, 2024, 2:57am

Huh, I guess at this point it seems like a nixos-specific issue? Just had this happen to me today, seemingly at random, although rebooting it fixed the issue (don’t know how long for tho). Haven’t updated at all recently which is strange .

I set hardware.enableAllFirmware = true;
System: AMD Nixos24.05pre-git (Uakari), Kernel 6.7.11 + zfs

Ghett_Klapson · April 11, 2024, 3:55pm

I don’t think it did unfortunately. But I also haven’t been that diligent in logging everything since I’ve been busy and just do a reboot every time, so I can’t say the specifics. Unfortunately the last time this happened isn’t in the logs I do have.

For posterity when this happens again, current system is:
NixOS 24.05.20240408.4cba8b5 (Uakari) x86_64
Kernel 6.8.4
Built on this config commit.

enableRedistributableFirmware defaults to what enableAllFirmware is set to

Ghett_Klapson · April 15, 2024, 6:45am

Happened again today. I also realize this is a fw13 thread. I’m getting this on the fw16.

The relevant lines seem to be:

Apr 14 23:35:39 NOlaptop kwin_wayland[1896]: This plugin does not support raise()
Apr 14 23:35:39 NOlaptop kwin_wayland_wrapper[1896]: amdgpu: amdgpu_cs_ctx_create2 failed. (-13)
Apr 14 23:35:40 NOlaptop kernel: amdgpu 0000:c4:00.0: [drm] *ERROR* Error queueing DMUB command: status=2
Apr 14 23:35:41 NOlaptop kernel: amdgpu 0000:c4:00.0: [drm] *ERROR* Error queueing DMUB command: status=2

I can post the full log if necessary.

I’m on:
NixOS 24.05.20240410.1042fd8 (Uakari)
kernel 6.8.5
AMD Ryzen 7 7840HS

Jeremy_Fitzhardinge · April 22, 2024, 2:15am

I’m getting this on a Framework 13 running Fedora 40. My errors have “status=2” rather than 3 mentioned earlier in the thread. It came out of nowhere while I was doing very ordinary gnome terminal/vscode work.

Ghett_Klapson · April 25, 2024, 10:19pm

I do hope today’s BIOS update fixes this.
Edit: it does not.

Perhaps we need to reach out to support to get further steps, as I’ve been getting the error very frequently over the last few days.

Brath · April 28, 2024, 5:37pm

What you guys’ graphics memory allocation setting in BIOS?
I was having the same problem and noticed maxed out memory on integrated graphics. Changed the memory allocation from “Auto” to “Gaming” and I haven’t seen the issue since.
It’s hard to verify this was the cause, but it’s worth a try.

Ghett_Klapson · April 28, 2024, 5:40pm

Hmm, I’ll check if memory is maxxed next time this happens. What utility are you using to check?

I do have mine set to Auto as well iirc.

Brath · April 28, 2024, 11:42pm

I’m on Fedora using nvtop. It got to a point where the history graph it displays would get “choppy” and that seemed suspicious. I think Auto was allocating 500MB to graphics, now it’s allocating 4GB, I think.

Ghett_Klapson · May 8, 2024, 6:05pm

I haven’t had this issue pop up in a while (although I have been rebooting frequently due to wifi not working on resume), however last night I managed to trigger it twice when an external display device had a bad connection. And it seemed to be completely frozen, although I didn’t try to ssh to check. If I have time today I’ll try to repro.

Mildred · June 10, 2024, 11:09pm

Getting the same issue on FW13 Ryzen while testing HDMI (I can only have 640x480). After I tested the second port, I probably had a bad connection and got the issue : sluggish mouse (but not so the keyboard), maxed video memory, and flesh error message (with status=2). Will try the BIOS setting.

Mildred · June 11, 2024, 12:47am

After a BIOS upgrade (from 0.0.3.3 to 0.0.3.5), diagnosis of POST codes and fixing it by reseating the DDR, set BIOS to gaming mode again, checked nvtop to see there is plenty of video memory, I still managed to trigger the same issue connecting the HDMI expansion card to USB3 on the right…

mikeymop · June 13, 2024, 3:24am

I noticed vscode seems to trigger it.

I also get status=2.

I started using Lapce and it doesn’t appear but if i start using vs code again it triggers after a short while.

The system can paint one frame every 30s when this happens so I have to force reboot the machine.

Brit_Butler · July 12, 2024, 6:12pm

Hi @pkprotoplasm. Have you encountered this issue since using the updated firmware blob?

I’m on a recent AMD FW13 running Ubuntu 24.04 LTS and have hit this twice in the last week or so. I ask only because you seemed to upgrade on Jan 2nd and never mentioned seeing the issue again.

Brit_Butler · August 12, 2024, 9:32pm

I’m not sure how to add pressure to the AMD framework group to investigate the bug pkprotoplasm pointed out (which looks unassigned and pretty dormant), however this commit to the amd-staging branch is promising. That is released soon and resolves the issue. It hasn’t crashed on me since the Ubuntu 6.8.0-40 kernel patch but it’s been 48 hours I’m not holding my breath.

pkprotoplasm · August 17, 2024, 1:17am

Yes I was still seeing the issue, albeit somewhat rarely, before I stopped using the FW13 on a regular basis. I returned to using it for the first time in some months today after getting the upgraded screen, and wouldn’t you know it, I’m now seeing different fun amdgpu crashes.

I need stability for my productivity so my daily driver is a MBP now.

[ 3202.711724] [     C11] gmc_v11_0_process_interrupt: 146 callbacks suppressed
[ 3202.711731] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711740] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711744] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103ae6000 from client 18
[ 3202.711749] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00303A11
[ 3202.711753] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: unknown (0x1d)
[ 3202.711758] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x1
[ 3202.711763] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.711767] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x1
[ 3202.711771] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.711776] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3202.711781] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711786] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711791] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103a05000 from client 18
[ 3202.711796] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.711800] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: VMC (0x0)
[ 3202.711804] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x0
[ 3202.711808] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.711812] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x0
[ 3202.711816] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.711819] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3202.711823] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711827] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711830] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103a00000 from client 18
[ 3202.711833] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.711836] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: VMC (0x0)
[ 3202.711839] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x0
[ 3202.711841] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.711844] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x0
[ 3202.711847] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.711849] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3202.711859] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711862] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711865] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103ae8000 from client 18
[ 3202.711868] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.711870] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: VMC (0x0)
[ 3202.711873] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x0
[ 3202.711875] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.711878] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x0
[ 3202.711881] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.711883] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3202.711894] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711897] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711900] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103ae6000 from client 18
[ 3202.711903] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.711905] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: VMC (0x0)
[ 3202.711908] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x0
[ 3202.711911] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.711913] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x0
[ 3202.711916] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.711918] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3202.711990] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711994] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711997] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103a05000 from client 18
[ 3202.712000] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00303A11
[ 3202.712003] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: unknown (0x1d)
[ 3202.712005] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x1
[ 3202.712008] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.712010] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x1
[ 3202.712013] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.712015] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3202.712025] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.712028] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.712031] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103a00000 from client 18
[ 3202.712034] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00303A11
[ 3202.712036] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: unknown (0x1d)
[ 3202.712039] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x1
[ 3202.712042] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.712044] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x1
[ 3202.712047] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.712049] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3202.712063] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.712066] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.712069] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103ae6000 from client 18
[ 3202.712072] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.712075] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: VMC (0x0)
[ 3202.712078] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x0
[ 3202.712081] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.712084] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x0
[ 3202.712087] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.712090] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3202.712101] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.712104] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.712107] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103ae8000 from client 18
[ 3202.712110] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.712113] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: VMC (0x0)
[ 3202.712115] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x0
[ 3202.712118] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.712120] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x0
[ 3202.712123] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.712125] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3202.712130] [     C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.712133] [     C11] amdgpu 0000:c1:00.0: amdgpu:  in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.712136] [     C11] amdgpu 0000:c1:00.0: amdgpu:   in page starting at address 0x0000800103ae6000 from client 18
[ 3202.712139] [     C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.712141] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 Faulty UTCL2 client ID: VMC (0x0)
[ 3202.712144] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MORE_FAULTS: 0x0
[ 3202.712146] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 3202.712149] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x0
[ 3202.712151] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 3202.712154] [     C11] amdgpu 0000:c1:00.0: amdgpu: 	 RW: 0x0
[ 3212.882816] [  T14981] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_unified_0 timeout, signaled seq=24194, emitted seq=24195
[ 3212.883417] [  T14981] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283
[ 3212.883721] [  T14981] amdgpu 0000:c1:00.0: amdgpu: GPU reset begin!
[ 3213.219870] [  T14981] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
[ 3213.467797] [  T14981] [drm] Register(0) [regUVD_RB_RPTR] failed to reach value 0x000000c0 != 0x00000080n
[ 3213.717065] [  T14981] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
[ 3213.724385] [  T14981] amdgpu 0000:c1:00.0: amdgpu: MODE2 reset
[ 3213.763181] [  T14981] amdgpu 0000:c1:00.0: amdgpu: GPU reset succeeded, trying to resume
[ 3213.764087] [  T14981] [drm] PCIE GART of 512M enabled (table at 0x000000801FD00000).
[ 3213.764207] [  T14981] [drm] VRAM is lost due to GPU reset!
[ 3213.764210] [  T14981] amdgpu 0000:c1:00.0: amdgpu: SMU is resuming...
[ 3213.766613] [  T14981] amdgpu 0000:c1:00.0: amdgpu: SMU is resumed successfully!
[ 3213.769004] [  T14981] [drm] DMUB hardware initialized: version=0x08004000
[ 3214.201329] [  T14981] [drm] kiq ring mec 3 pipe 1 q 0
[ 3214.461142] [  T14981] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
[ 3214.461382] [  T14981] amdgpu 0000:c1:00.0: [drm:jpeg_v4_0_hw_init [amdgpu]] JPEG decode initialized successfully.
[ 3214.462009] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ 3214.462013] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ 3214.462016] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ 3214.462018] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[ 3214.462020] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[ 3214.462022] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[ 3214.462024] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[ 3214.462026] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[ 3214.462029] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[ 3214.462031] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[ 3214.462034] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[ 3214.462036] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring jpeg_dec uses VM inv eng 1 on hub 8
[ 3214.462038] [  T14981] amdgpu 0000:c1:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 13 on hub 0
[ 3214.467273] [  T14981] amdgpu 0000:c1:00.0: amdgpu: recover vram bo from shadow start
[ 3214.467283] [  T14981] amdgpu 0000:c1:00.0: amdgpu: recover vram bo from shadow done
[ 3214.467322] [  T14981] amdgpu 0000:c1:00.0: amdgpu: GPU reset(1) succeeded!
[ 3214.470941] [   T5283] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
[ 3214.602261] [   T5283] show_signal_msg: 173 callbacks suppressed
[ 3214.602267] [   T5283] firefox-bi:cs0[5283]: segfault at 0 ip 000055e4773da7ba sp 00007fdcc8bff9c0 error 6 in firefox-bin[ac7ba,55e47734e000+c9000] likely on CPU 1 (core 0, socket 0)
[ 3214.602283] [   T5283] Code: 41 56 53 50 48 89 fb 4c 8b 35 42 d7 03 00 49 8b 36 e8 0a b2 03 00 49 8b 36 bf 0a 00 00 00 e8 ed b2 03 00 48 89 1d 4e 09 04 00 <c7> 04 25 00 00 00 00 23 00 00 00 e8 06 00 00 00 cc cc cc cc cc cc
[ 3215.722786] [  T12823] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000000n
[ 3215.978033] [  T12823] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000000n
[ 3224.619321] [  T16752] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3234.642769] [  T14752] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3244.669679] [  T13792] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3254.696077] [  T16806] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3264.723926] [  T16806] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3270.248741] [      C1] sched: RT throttling activated
[ 3270.298947] [  T17320] usb 1-4: reset full-speed USB device number 2 using xhci_hcd
[ 3270.585638] [  T17320] usb 1-4: reset full-speed USB device number 2 using xhci_hcd
[ 3274.748853] [  T17163] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3284.775731] [  T17163] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3295.015795] [  T17163] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3305.042251] [  T14753] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3315.068875] [  T17163] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered

lpapadakos · October 2, 2024, 9:32pm

It’s important these threads are linked to share information and workarounds

This seems to be an amdgpu issue more than it is a Framework issue? (though I didn’t have it in my older laptop with gen1 Ryzen)

Here’s my answer in the Framework 16 thread: