What you guys’ graphics memory allocation setting in BIOS?
I was having the same problem and noticed maxed out memory on integrated graphics. Changed the memory allocation from “Auto” to “Gaming” and I haven’t seen the issue since.
It’s hard to verify this was the cause, but it’s worth a try.
Hmm, I’ll check if memory is maxxed next time this happens. What utility are you using to check?
I do have mine set to Auto as well iirc.
I’m on Fedora using nvtop. It got to a point where the history graph it displays would get “choppy” and that seemed suspicious. I think Auto was allocating 500MB to graphics, now it’s allocating 4GB, I think.
I haven’t had this issue pop up in a while (although I have been rebooting frequently due to wifi not working on resume), however last night I managed to trigger it twice when an external display device had a bad connection. And it seemed to be completely frozen, although I didn’t try to ssh to check. If I have time today I’ll try to repro.
Getting the same issue on FW13 Ryzen while testing HDMI (I can only have 640x480). After I tested the second port, I probably had a bad connection and got the issue : sluggish mouse (but not so the keyboard), maxed video memory, and flesh error message (with status=2). Will try the BIOS setting.
After a BIOS upgrade (from 0.0.3.3 to 0.0.3.5), diagnosis of POST codes and fixing it by reseating the DDR, set BIOS to gaming mode again, checked nvtop to see there is plenty of video memory, I still managed to trigger the same issue connecting the HDMI expansion card to USB3 on the right…
I noticed vscode seems to trigger it.
I also get status=2.
I started using Lapce and it doesn’t appear but if i start using vs code again it triggers after a short while.
The system can paint one frame every 30s when this happens so I have to force reboot the machine.
Hi @pkprotoplasm. Have you encountered this issue since using the updated firmware blob?
I’m on a recent AMD FW13 running Ubuntu 24.04 LTS and have hit this twice in the last week or so. I ask only because you seemed to upgrade on Jan 2nd and never mentioned seeing the issue again.
I’m not sure how to add pressure to the AMD framework group to investigate the bug pkprotoplasm pointed out (which looks unassigned and pretty dormant), however this commit to the amd-staging branch is promising. That is released soon and resolves the issue. It hasn’t crashed on me since the Ubuntu 6.8.0-40 kernel patch but it’s been 48 hours I’m not holding my breath.
Yes I was still seeing the issue, albeit somewhat rarely, before I stopped using the FW13 on a regular basis. I returned to using it for the first time in some months today after getting the upgraded screen, and wouldn’t you know it, I’m now seeing different fun amdgpu crashes.
I need stability for my productivity so my daily driver is a MBP now.
[ 3202.711724] [ C11] gmc_v11_0_process_interrupt: 146 callbacks suppressed
[ 3202.711731] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711740] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711744] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103ae6000 from client 18
[ 3202.711749] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00303A11
[ 3202.711753] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: unknown (0x1d)
[ 3202.711758] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x1
[ 3202.711763] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.711767] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x1
[ 3202.711771] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.711776] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3202.711781] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711786] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711791] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103a05000 from client 18
[ 3202.711796] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.711800] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: VMC (0x0)
[ 3202.711804] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x0
[ 3202.711808] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.711812] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 3202.711816] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.711819] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3202.711823] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711827] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711830] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103a00000 from client 18
[ 3202.711833] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.711836] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: VMC (0x0)
[ 3202.711839] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x0
[ 3202.711841] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.711844] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 3202.711847] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.711849] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3202.711859] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711862] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711865] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103ae8000 from client 18
[ 3202.711868] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.711870] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: VMC (0x0)
[ 3202.711873] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x0
[ 3202.711875] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.711878] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 3202.711881] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.711883] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3202.711894] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711897] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711900] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103ae6000 from client 18
[ 3202.711903] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.711905] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: VMC (0x0)
[ 3202.711908] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x0
[ 3202.711911] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.711913] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 3202.711916] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.711918] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3202.711990] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.711994] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.711997] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103a05000 from client 18
[ 3202.712000] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00303A11
[ 3202.712003] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: unknown (0x1d)
[ 3202.712005] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x1
[ 3202.712008] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.712010] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x1
[ 3202.712013] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.712015] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3202.712025] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.712028] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.712031] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103a00000 from client 18
[ 3202.712034] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00303A11
[ 3202.712036] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: unknown (0x1d)
[ 3202.712039] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x1
[ 3202.712042] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.712044] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x1
[ 3202.712047] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.712049] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3202.712063] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.712066] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.712069] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103ae6000 from client 18
[ 3202.712072] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.712075] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: VMC (0x0)
[ 3202.712078] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x0
[ 3202.712081] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.712084] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 3202.712087] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.712090] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3202.712101] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.712104] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.712107] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103ae8000 from client 18
[ 3202.712110] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.712113] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: VMC (0x0)
[ 3202.712115] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x0
[ 3202.712118] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.712120] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 3202.712123] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.712125] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3202.712130] [ C11] amdgpu 0000:c1:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:3 pasid:32811)
[ 3202.712133] [ C11] amdgpu 0000:c1:00.0: amdgpu: in process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283)
[ 3202.712136] [ C11] amdgpu 0000:c1:00.0: amdgpu: in page starting at address 0x0000800103ae6000 from client 18
[ 3202.712139] [ C11] amdgpu 0000:c1:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 3202.712141] [ C11] amdgpu 0000:c1:00.0: amdgpu: Faulty UTCL2 client ID: VMC (0x0)
[ 3202.712144] [ C11] amdgpu 0000:c1:00.0: amdgpu: MORE_FAULTS: 0x0
[ 3202.712146] [ C11] amdgpu 0000:c1:00.0: amdgpu: WALKER_ERROR: 0x0
[ 3202.712149] [ C11] amdgpu 0000:c1:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 3202.712151] [ C11] amdgpu 0000:c1:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 3202.712154] [ C11] amdgpu 0000:c1:00.0: amdgpu: RW: 0x0
[ 3212.882816] [ T14981] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_unified_0 timeout, signaled seq=24194, emitted seq=24195
[ 3212.883417] [ T14981] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process RDD Process pid 5053 thread firefox-bi:cs0 pid 5283
[ 3212.883721] [ T14981] amdgpu 0000:c1:00.0: amdgpu: GPU reset begin!
[ 3213.219870] [ T14981] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
[ 3213.467797] [ T14981] [drm] Register(0) [regUVD_RB_RPTR] failed to reach value 0x000000c0 != 0x00000080n
[ 3213.717065] [ T14981] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
[ 3213.724385] [ T14981] amdgpu 0000:c1:00.0: amdgpu: MODE2 reset
[ 3213.763181] [ T14981] amdgpu 0000:c1:00.0: amdgpu: GPU reset succeeded, trying to resume
[ 3213.764087] [ T14981] [drm] PCIE GART of 512M enabled (table at 0x000000801FD00000).
[ 3213.764207] [ T14981] [drm] VRAM is lost due to GPU reset!
[ 3213.764210] [ T14981] amdgpu 0000:c1:00.0: amdgpu: SMU is resuming...
[ 3213.766613] [ T14981] amdgpu 0000:c1:00.0: amdgpu: SMU is resumed successfully!
[ 3213.769004] [ T14981] [drm] DMUB hardware initialized: version=0x08004000
[ 3214.201329] [ T14981] [drm] kiq ring mec 3 pipe 1 q 0
[ 3214.461142] [ T14981] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
[ 3214.461382] [ T14981] amdgpu 0000:c1:00.0: [drm:jpeg_v4_0_hw_init [amdgpu]] JPEG decode initialized successfully.
[ 3214.462009] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ 3214.462013] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ 3214.462016] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ 3214.462018] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[ 3214.462020] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[ 3214.462022] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[ 3214.462024] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[ 3214.462026] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[ 3214.462029] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[ 3214.462031] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[ 3214.462034] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[ 3214.462036] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring jpeg_dec uses VM inv eng 1 on hub 8
[ 3214.462038] [ T14981] amdgpu 0000:c1:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 13 on hub 0
[ 3214.467273] [ T14981] amdgpu 0000:c1:00.0: amdgpu: recover vram bo from shadow start
[ 3214.467283] [ T14981] amdgpu 0000:c1:00.0: amdgpu: recover vram bo from shadow done
[ 3214.467322] [ T14981] amdgpu 0000:c1:00.0: amdgpu: GPU reset(1) succeeded!
[ 3214.470941] [ T5283] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
[ 3214.602261] [ T5283] show_signal_msg: 173 callbacks suppressed
[ 3214.602267] [ T5283] firefox-bi:cs0[5283]: segfault at 0 ip 000055e4773da7ba sp 00007fdcc8bff9c0 error 6 in firefox-bin[ac7ba,55e47734e000+c9000] likely on CPU 1 (core 0, socket 0)
[ 3214.602283] [ T5283] Code: 41 56 53 50 48 89 fb 4c 8b 35 42 d7 03 00 49 8b 36 e8 0a b2 03 00 49 8b 36 bf 0a 00 00 00 e8 ed b2 03 00 48 89 1d 4e 09 04 00 <c7> 04 25 00 00 00 00 23 00 00 00 e8 06 00 00 00 cc cc cc cc cc cc
[ 3215.722786] [ T12823] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000000n
[ 3215.978033] [ T12823] [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000000n
[ 3224.619321] [ T16752] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3234.642769] [ T14752] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3244.669679] [ T13792] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3254.696077] [ T16806] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3264.723926] [ T16806] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3270.248741] [ C1] sched: RT throttling activated
[ 3270.298947] [ T17320] usb 1-4: reset full-speed USB device number 2 using xhci_hcd
[ 3270.585638] [ T17320] usb 1-4: reset full-speed USB device number 2 using xhci_hcd
[ 3274.748853] [ T17163] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3284.775731] [ T17163] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3295.015795] [ T17163] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3305.042251] [ T14753] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
[ 3315.068875] [ T17163] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
It’s important these threads are linked to share information and workarounds
This seems to be an amdgpu issue more than it is a Framework issue? (though I didn’t have it in my older laptop with gen1 Ryzen)
Here’s my answer in the Framework 16 thread: