As per title. I am using the onboard GPU. My system decided to freeze. This started happening a couple of weeks ago with little thing, like my text editor freezing, or short browser freezes. I didn’t make much of it as I am, by now, used to see almost everything fail… anyway. Today my FW16 froze complete, for a few seconds, black screen, then the display came back but dimmed (I have auto dimming of as that is just completely useless and overactive). It seems, according to dmesg that the gpu has issues? [949897.015157] usb 1-4.1: reset full-speed USB device number 7 using xhci_hcd
[949906.620079] usb 1-4.1: reset full-speed USB device number 7 using xhci_hcd
[949910.388247] usb 1-4.1: reset full-speed USB device number 7 using xhci_hcd
[953387.075248] amdgpu 0000:c1:00.0: amdgpu: Dumping IP State
[953387.077961] amdgpu 0000:c1:00.0: amdgpu: Dumping IP State Completed
[953387.088028] amdgpu 0000:c1:00.0: amdgpu: ring gfx_0.0.0 timeout, signaled seq=100822092, emitted seq=100822094
[953387.088033] amdgpu 0000:c1:00.0: amdgpu: Process information: process gnome-shell pid 1610 thread gnome-shel:cs0 pid 1623
[953389.091867] amdgpu 0000:c1:00.0: amdgpu: MES failed to respond to msg=RESET
[953389.091874] [drm:amdgpu_mes_reset_legacy_queue [amdgpu]] ERROR failed to reset legacy queue
[953389.092158] amdgpu 0000:c1:00.0: amdgpu: GPU reset begin!
[953391.129736] amdgpu 0000:c1:00.0: amdgpu: MES failed to respond to msg=REMOVE_QUEUE
[953391.129744] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] ERROR failed to unmap legacy queue
[953391.342934] [drm:gfx_v11_0_cp_gfx_enable.isra.0 [amdgpu]] ERROR failed to halt cp gfx
[953391.344583] amdgpu 0000:c1:00.0: amdgpu: MODE2 reset
[953391.384251] amdgpu 0000:c1:00.0: amdgpu: GPU reset succeeded, trying to resume
[953391.384992] [drm] PCIE GART of 512M enabled (table at 0x00000080FFD00000).
[953391.385049] amdgpu 0000:c1:00.0: amdgpu: SMU is resuming…
[953391.386998] amdgpu 0000:c1:00.0: amdgpu: SMU is resumed successfully!
[953391.394433] [drm] DMUB hardware initialized: version=0x08004D00
[953392.246251] amdgpu 0000:c1:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[953392.246264] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[953392.246268] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[953392.246270] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[953392.246273] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[953392.246275] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[953392.246277] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[953392.246279] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[953392.246282] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[953392.246285] amdgpu 0000:c1:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[953392.246288] amdgpu 0000:c1:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[953392.246290] amdgpu 0000:c1:00.0: amdgpu: ring jpeg_dec uses VM inv eng 1 on hub 8
[953392.246293] amdgpu 0000:c1:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 13 on hub 0
[953392.249033] amdgpu 0000:c1:00.0: amdgpu: GPU reset(2) succeeded!
[953392.283827] [drm:amdgpu_cs_ioctl [amdgpu]] ERROR Failed to initialize parser -125!
[953417.221745] usb 1-4.1: reset full-speed USB device number 7 using xhci_hcd
[953417.395540] usb 1-4.1: reset full-speed USB device number 7 using xhci_hcd
Completely understandable. I love this new laptop but getting it to work reliably has been a real adventure; there are 3 separate big issues which all need some kind of workaround and it took a pretty significant length of time to sort it all out.
If reinstalling doesn’t solve it completely, I also found this kernel which seems to solve the GPU issues completely:
To be clear, it hasn’t been unreliable up until a few weeks ago, where it would seemingly randomly freeze. I am sorry to say, but all “technology” is failing quicker than a train leaves a station.
1: My 5y old LG tv is stuck on webos 4, can’t install recent apps.
2: My homepod suffers from loud pops.
3: My 1y old car needs constant updates to keep it driveable.
4: And now my 1y old framework laptop becomes unreliable.
it’s all crap, just crap and nothing else.
Anyway, with that out of my system, I reinstalled it with ubuntu 25.04. Runs fine for a week or so, than a complete lockup today, I made a video of it. Had to turn it off and on again. This is from dmesg, the laptop was not connected to a charger at the time it froze, was about 75%:
[ 12.687074] tee tee0: Direct firmware load for /amdtee/f29bb3d9-bd66-5441-afb88acc2b2b60d6.bin failed with error -2
[ 12.687084] failed to load firmware /amdtee/f29bb3d9-bd66-5441-afb88acc2b2b60d6.bin
[ 12.687090] failed to copy TA binary
[ 12.687095] Failed to open TEE session err:0x0, rc:-12
[ 12.687103] amd-pmf AMDI0102:00: Failed to open TA session (-12)
[ 12.687119] amd-pmf AMDI0102:00: registered PMF device successfully
[ 12.688811] cros_ec_lpcs cros_ec_lpcs.0: Chrome EC device registered
[ 12.691376] [drm] Initialized amdxdna_accel_driver 0.0.0 for 0000:c2:00.1 on minor 0
[ 12.717643] usbcore: registered new interface driver btusb
[ 12.723595] snd_hda_intel 0000:c1:00.1: enabling device (0000 -> 0002)
[ 12.723693] snd_hda_intel 0000:c1:00.1: Handle vga_switcheroo audio client
[ 12.723791] snd_hda_intel 0000:c1:00.6: enabling device (0000 -> 0002)
[ 12.728050] cros-charge-control cros-charge-control.5.auto: Framework charge control detected, preventing load
[ 12.728748] mt7921e 0000:01:00.0: enabling device (0000 -> 0002)
[ 12.728802] kvm_amd: TSC scaling supported
[ 12.728805] kvm_amd: Nested Virtualization enabled
[ 12.728806] kvm_amd: Nested Paging enabled
[ 12.728808] kvm_amd: LBR virtualization supported
[ 12.728816] kvm_amd: Virtual GIF supported
[ 12.728817] kvm_amd: Virtual NMI enabled
[ 12.736389] mt7921e 0000:01:00.0: ASIC revision: 79220010
[ 12.736840] cros-usbpd-charger cros-usbpd-charger.6.auto: No USB PD charging ports found
[ 12.738276] cros-usbpd-charger cros-usbpd-charger.6.auto: Unexpected number of charge port count
[ 12.738281] cros-usbpd-charger cros-usbpd-charger.6.auto: Failing probe (err:0xffffffb9)
[ 12.738285] cros-usbpd-charger cros-usbpd-charger.6.auto: probe with driver cros-usbpd-charger failed with error -71
Completely agree. I’ve now had the GPU lockup again, on the AMD staging kernel. Also wireless flakiness I was experiencing has come back even though I disabled power saving (which I thought was the solution).
IDK man. I filed bugs about both, we’ll see how it works. Framework support doesn’t seem real confidence inspiring in terms of helping to solve the issues either. I think for the wireless thing, I may just wind up purchasing a non Mediatek card, I’ve heard from people who’ve done that and it actually addressed the issue.
If you do find a solution to the GPU thing, let me know, I’d love to try it out.
tgnome-shell[1958]: Connection to xwayland lost
gnome-shell[1958]: Xwayland terminated, exiting since it was mandatory
gnome-shell[1958]: JS ERROR: Gio.IOErrorEnum: Xwayland exited unexpectedly