Another data point here.
-
FW 13 AMD 7640u, 64GB RAM (g.skill), BIOS 3.02. Arrived today. Lovely machine.
-
Installed to encrypted SSD from Fedora KDE ISO.
-
First boot after install, login screen working. Logged in and updated via the terminal. Rebooted.
-
On boot after update (6.5.7-300 kernel), freezes at blank screen after unlocking encrypted disk. Manually switching to a console using Ctrl+Alt+F2, logging in, and entering
sudo service sddm restart
brings up a working graphical login screen. -
Rebooting using the backup 6.5.2 kernel does not resolve the issue.
-
I have added an extract of the relevant
dmesg
output below, starting as soon as the GPU starts having issues.
[ 17.335441] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=14
[ 17.335726] [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
[ 17.465188] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=14
[ 17.465341] [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
[ 17.605605] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=14
[ 17.605858] [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
[ 17.735282] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=14
[ 17.735429] [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
[ 27.270500] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=37, emitted seq=39
[ 27.271068] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0
[ 27.271566] amdgpu 0000:c1:00.0: amdgpu: GPU reset begin!
[ 27.450991] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
[ 27.451154] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[ 27.580713] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
[ 27.580861] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[ 27.710285] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
[ 27.710440] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[ 27.839960] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
[ 27.840108] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[ 27.969633] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
[ 27.969796] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[ 28.099342] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
[ 28.099496] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[ 28.229010] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
[ 28.229166] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[ 28.358705] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
[ 28.358855] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[ 28.488417] [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
[ 28.488565] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[ 28.490484] amdgpu 0000:c1:00.0: amdgpu: MODE2 reset
[ 28.518860] amdgpu 0000:c1:00.0: amdgpu: GPU reset succeeded, trying to resume
[ 28.519438] [drm] PCIE GART of 512M enabled (table at 0x000000801FD00000).
[ 28.519629] amdgpu 0000:c1:00.0: amdgpu: SMU is resuming...
[ 28.521271] amdgpu 0000:c1:00.0: amdgpu: SMU is resumed successfully!
[ 28.523313] [drm] DMUB hardware initialized: version=0x08001E00
[ 28.528020] [drm] REG_WAIT timeout 1us * 1000 tries - dcn314_dsc_pg_control line:264
[ 28.530527] [drm] REG_WAIT timeout 1us * 1000 tries - dcn314_dsc_pg_control line:272
[ 28.533027] [drm] REG_WAIT timeout 1us * 1000 tries - dcn314_dsc_pg_control line:280
[ 28.535529] [drm] REG_WAIT timeout 1us * 1000 tries - dcn314_dsc_pg_control line:288
[ 28.543362] [drm] REG_WAIT timeout 1us * 1000 tries - dcn314_dsc_pg_control line:264
[ 28.545869] [drm] REG_WAIT timeout 1us * 1000 tries - dcn314_dsc_pg_control line:272
[ 28.548371] [drm] REG_WAIT timeout 1us * 1000 tries - dcn314_dsc_pg_control line:280
[ 28.550872] [drm] REG_WAIT timeout 1us * 1000 tries - dcn314_dsc_pg_control line:288
[ 28.894278] [drm] kiq ring mec 3 pipe 1 q 0
[ 28.896557] [drm] VCN decode and encode initialized successfully(under DPG Mode).
[ 28.896700] amdgpu 0000:c1:00.0: [drm:jpeg_v4_0_hw_init [amdgpu]] JPEG decode initialized success
fully.
[ 28.897413] amdgpu 0000:c1:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ 28.897416] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ 28.897417] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ 28.897418] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[ 28.897419] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[ 28.897421] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[ 28.897422] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[ 28.897423] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[ 28.897424] amdgpu 0000:c1:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[ 28.897425] amdgpu 0000:c1:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[ 28.897427] amdgpu 0000:c1:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[ 28.897428] amdgpu 0000:c1:00.0: amdgpu: ring jpeg_dec uses VM inv eng 1 on hub 8
[ 28.897429] amdgpu 0000:c1:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 13 on hub 0
[ 28.900151] amdgpu 0000:c1:00.0: amdgpu: recover vram bo from shadow start
[ 28.900152] amdgpu 0000:c1:00.0: amdgpu: recover vram bo from shadow done
[ 28.900196] [drm] Skip scheduling IBs!
[ 28.900205] [drm] Skip scheduling IBs!
[ 28.900210] [drm] Skip scheduling IBs!
[ 28.900214] [drm] Skip scheduling IBs!
[ 28.900218] [drm] Skip scheduling IBs!
[ 28.900222] [drm] Skip scheduling IBs!
[ 28.900227] [drm] Skip scheduling IBs!
[ 28.900231] [drm] Skip scheduling IBs!
[ 28.900235] [drm] Skip scheduling IBs!
[ 28.900249] [drm] Skip scheduling IBs!
[ 28.900253] [drm] Skip scheduling IBs!
[ 28.900256] [drm] Skip scheduling IBs!
[ 28.900260] [drm] Skip scheduling IBs!
[ 28.900263] [drm] Skip scheduling IBs!
[ 28.900269] [drm] Skip scheduling IBs!
[ 28.900273] [drm] Skip scheduling IBs!
[ 28.900276] [drm] Skip scheduling IBs!
[ 28.900277] [drm] Skip scheduling IBs!
[ 28.900283] [drm] Skip scheduling IBs!
[ 28.900286] [drm] Skip scheduling IBs!
[ 28.901248] [drm] ring gfx_32770.1.1 was added
[ 28.902122] [drm] ring compute_32770.2.2 was added
[ 28.902925] [drm] ring sdma_32770.3.3 was added
[ 28.902952] [drm] ring gfx_32770.1.1 ib test pass
[ 28.902979] [drm] ring compute_32770.2.2 ib test pass
[ 28.903155] [drm] ring sdma_32770.3.3 ib test pass
[ 28.904399] amdgpu 0000:c1:00.0: amdgpu: GPU reset(1) succeeded!
[ 29.176697] [drm] Skip scheduling IBs!
[ 29.182226] [drm] Skip scheduling IBs!
[ 29.182594] [drm] Skip scheduling IBs!
[ 29.182731] show_signal_msg: 59 callbacks suppressed
[ 29.182734] kwin_wayland[1675]: segfault at 0 ip 00007fc921dd6750 sp 00007ffe4557cd98 error 6 in libkwineffects.so.5.27.8[7fc921dc4000+29000] likely on CPU 4 (core 2, socket 0)
[ 29.182745] Code: d2 74 30 4c 8b 07 31 c0 4d 03 40 10 66 0f 1f 44 00 00 48 63 c8 48 89 c7 48 83 c0 01 48 c1 e1 04 48 c1 e7 04 f3 41 0f 6f 04 08 <0f> 11 04 3e 48 39 d0 75 df c3 66 0f 1f 44 00 00 f3 0f 1e fa 55 66
[ 29.571405] [drm] Skip scheduling IBs!
[ 29.571444] [drm] Skip scheduling IBs!
[ 29.571464] [drm] Skip scheduling IBs!
[ 29.571475] [drm] Skip scheduling IBs!
[ 29.571487] [drm] Skip scheduling IBs!
[ 29.571510] [drm] Skip scheduling IBs!
[ 29.571525] [drm] Skip scheduling IBs!
[ 29.571536] [drm] Skip scheduling IBs!
[ 29.571743] [drm] Skip scheduling IBs!
[ 29.571758] [drm] Skip scheduling IBs!
[ 29.571769] [drm] Skip scheduling IBs!
[ 29.571779] [drm] Skip scheduling IBs!
There is a segfault pointing at the kwin
compositor’s graphical effects, as loaded by the sddm
login manager - not sure if this is the cause of the GFX card reset or a symptom.
[ 29.182734] kwin_wayland[1675]: segfault at 0 ip 00007fc921dd6750 sp 00007ffe4557cd98 error 6 in libkwineffects.so.5.27.8[7fc921dc4000+29000] likely on CPU 4 (core 2, socket 0)
The suggestion by @Shibusuke of using the default Fedora ISO with GNOME, installing the KDE desktop, and using the GNOME login manager should be an effective workaround. There is still an underlying issue here though - (potentially) flawed user-mode software shouldn’t necessarily be able to cause a reset of the graphics card.