Video hardware acceleration crashes display in kernel version 6.10.3

I suggest you raise a bug report with your distro to get mesa updated.

For fedora I see this bug…
https://bugzilla.redhat.com/show_bug.cgi?id=2299241

Don’t know if we can make it to have higher priority. And correct Assignee.

thank @Mario_Limonciello :wink:
https://dl.fedoraproject.org/pub/fedora/linux/updates/40/Everything/x86_64/Packages/m/

mesa-dri-drivers-24.1.6-1.fc40.x86_64.rpm 2024-08-19 08:51 26M

Look to be on the way… may be only wait for mirror update now on fedora.

Thanks for the reply with the bug both on mesa and on redhat.
I don’t fully understand how the graphics driver communicate or why it also works with 6.9, but hopefully the next mesa fixes it because it’s very annoying. Even VLC crashes :frowning:
I guess I’ll stick to 6.9 until the next mesa release is available on stable
Edit: apparently it was approved yesterday, I will test it later today.

I got the mesa 24.1.6 update today and re-enabled hardware acceleration in firefox.
I just watched some youtube videos for an hour or so and hadn’t noticed any problems.
Seems this case is solved now.

1 Like

Thank you very much! I also updated MESA (still using 6.9 and I’ll test with 6.10.x now as well) :slight_smile:

I’ve had two crashes tonight while I was sleeping and the laptop sitting idle. Firefox with three windows open (my hotlist, vrm portal and this forum). After the crash, all programs were closed, GUI restart and sitting at the login prompt. System is openSUSE tumbleweed, uptime around 2 days.
mesa version is probably still too old, albeit tumbleweed otherwise being at the leading edge.

Mesa 24.1.3-1699.392
Kernel 6.10.5-1-default
Firefox 129.0 (64-Bit) openSUSE

[24. Aug 23:16] [  T88836] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=40108971, emitted seq=40108973
[  +0,000178] [  T88836] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process firefox pid 5995 thread firefox:cs0 pid 6078
[  +0,000137] [  T88836] amdgpu 0000:c2:00.0: amdgpu: GPU reset begin!
[24. Aug 23:17] [  T88836] amdgpu 0000:c2:00.0: amdgpu: MES failed to respond to msg=REMOVE_QUEUE
[  +0,000007] [  T88836] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[  +0,206490] [  T88836] [drm:gfx_v11_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
[  +0,001675] [  T88836] amdgpu 0000:c2:00.0: amdgpu: MODE2 reset
[  +0,034447] [  T88836] amdgpu 0000:c2:00.0: amdgpu: GPU reset succeeded, trying to resume
[  +0,000633] [  T88836] [drm] PCIE GART of 512M enabled (table at 0x000000807FD00000).
[  +0,000206] [  T88836] [drm] VRAM is lost due to GPU reset!
[  +0,000003] [  T88836] amdgpu 0000:c2:00.0: amdgpu: SMU is resuming...
[  +0,001743] [  T88836] amdgpu 0000:c2:00.0: amdgpu: SMU is resumed successfully!
[  +0,002200] [  T88836] [drm] DMUB hardware initialized: version=0x08004000
[  +0,009451] [  T88836] [drm] kiq ring mec 3 pipe 1 q 0
[  +0,002419] [  T88836] amdgpu 0000:c2:00.0: [drm:jpeg_v4_0_hw_init [amdgpu]] JPEG decode initialized successfully.
[  +0,000694] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[  +0,000005] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[  +0,000003] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[  +0,000003] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[  +0,000002] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[  +0,000003] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[  +0,000002] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[  +0,000002] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[  +0,000003] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[  +0,000002] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[  +0,000003] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[  +0,000003] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring jpeg_dec uses VM inv eng 1 on hub 8
[  +0,000002] [  T88836] amdgpu 0000:c2:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 13 on hub 0
[  +0,001891] [  T88836] amdgpu 0000:c2:00.0: amdgpu: recover vram bo from shadow start
[  +0,000002] [  T88836] amdgpu 0000:c2:00.0: amdgpu: recover vram bo from shadow done
[  +0,000018] [  T88836] amdgpu 0000:c2:00.0: amdgpu: GPU reset(2) succeeded!
[  +0,960163] [   T1972] [drm] Send DSC enable to synaptics

[24. Aug 23:27] [  T89873] ------------[ cut here ]------------
[  +0,000007] [  T89873] refcount_t: underflow; use-after-free.
[  +0,000009] [  T89873] WARNING: CPU: 2 PID: 89873 at lib/refcount.c:28 refcount_warn_saturate+0xbe/0x110
[  +0,000012] [  T89873] Modules linked in: ccm rfcomm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device af_packet nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf>
[  +0,000097] [  T89873]  snd_hda_scodec_component amd_atl intel_rapl_msr snd_soc_core snd_hda_codec_hdmi intel_rapl_common btusb uvcvideo btrtl edac_mce_amd snd_compress btintel >
[  +0,000114] [  T89873]  scsi_dh_alua sg scsi_mod scsi_common typec_displayport usbhid amdgpu amdxcp i2c_algo_bit drm_ttm_helper ttm drm_exec crct10dif_pclmul crc32_pclmul gpu_sc>
[  +0,000075] [  T89873] CPU: 2 PID: 89873 Comm: Xorg.bin Not tainted 6.10.5-1-default #1 openSUSE Tumbleweed 69f48d427608e1c09e60ea24c6c55e2ca1b049e8
[  +0,000005] [  T89873] Hardware name: Framework Laptop 16 (AMD Ryzen 7040 Series)/FRANMZCP07, BIOS 03.04 07/09/2024
[  +0,000003] [  T89873] RIP: 0010:refcount_warn_saturate+0xbe/0x110
[  +0,000005] [  T89873] Code: 01 01 e8 55 4d a7 ff 0f 0b e9 29 5f 8b 00 80 3d 60 ac a5 01 00 75 85 48 c7 c7 68 c9 69 89 c6 05 50 ac a5 01 01 e8 32 4d a7 ff <0f> 0b e9 06 5f 8b 00>
[  +0,000002] [  T89873] RSP: 0018:ffffb3bd23627a50 EFLAGS: 00010286
[  +0,000004] [  T89873] RAX: 0000000000000000 RBX: 0000000000000001 RCX: 0000000000000027
[  +0,000002] [  T89873] RDX: ffff88975fb27808 RSI: 0000000000000001 RDI: ffff88975fb27800
[  +0,000002] [  T89873] RBP: ffff888a310c01a0 R08: 0000000000000000 R09: ffffb3bd23627900
[  +0,000002] [  T89873] R10: ffffb3bd236278f8 R11: 0000000000000003 R12: ffff888a310c0000
[  +0,000001] [  T89873] R13: 0000000000000003 R14: 0000000000000004 R15: 000000000000003c
[  +0,000002] [  T89873] FS:  00007effca009980(0000) GS:ffff88975fb00000(0000) knlGS:0000000000000000
[  +0,000003] [  T89873] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  +0,000002] [  T89873] CR2: 00007f4d2f4d2e40 CR3: 0000000284d52000 CR4: 0000000000750ef0
[  +0,000003] [  T89873] PKRU: 55555554
[  +0,000001] [  T89873] Call Trace:
[  +0,000003] [  T89873]  <TASK>
[  +0,000003] [  T89873]  ? refcount_warn_saturate+0xbe/0x110
[  +0,000004] [  T89873]  ? __warn.cold+0xa8/0x102
[  +0,000007] [  T89873]  ? refcount_warn_saturate+0xbe/0x110
[  +0,000010] [  T89873]  ? report_bug+0xd8/0x150
[  +0,000006] [  T89873]  ? handle_bug+0x3c/0x80
[  +0,000006] [  T89873]  ? exc_invalid_op+0x17/0x70
[  +0,000004] [  T89873]  ? asm_exc_invalid_op+0x1a/0x20
[  +0,000009] [  T89873]  ? refcount_warn_saturate+0xbe/0x110
[  +0,000006] [  T89873]  dc_state_destruct+0x51/0x250 [amdgpu e27df8c7e1e85939e5f4b2290a27f9148f18165e]
[  +0,000397] [  T89873]  dc_state_release+0x42/0x90 [amdgpu e27df8c7e1e85939e5f4b2290a27f9148f18165e]
[  +0,000302] [  T89873]  dm_atomic_destroy_state+0x20/0x30 [amdgpu e27df8c7e1e85939e5f4b2290a27f9148f18165e]
[  +0,000347] [  T89873]  drm_atomic_state_default_clear+0x225/0x2e0
[  +0,000008] [  T89873]  __drm_atomic_state_free+0x71/0xb0
[  +0,000004] [  T89873]  drm_mode_obj_set_property_ioctl+0x35f/0x3b0
[  +0,000008] [  T89873]  ? __pfx_drm_connector_property_set_ioctl+0x10/0x10
[  +0,000006] [  T89873]  drm_connector_property_set_ioctl+0x3b/0x60
[  +0,000005] [  T89873]  drm_ioctl_kernel+0xaa/0x100
[  +0,000005] [  T89873]  drm_ioctl+0x25d/0x4c0
[  +0,000003] [  T89873]  ? __pfx_drm_connector_property_set_ioctl+0x10/0x10
[  +0,000005] [  T89873]  ? drm_connector_property_set_ioctl+0x3b/0x60
[  +0,000004] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000004] [  T89873]  ? drm_ioctl_kernel+0xaa/0x100
[  +0,000003] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000003] [  T89873]  ? __check_object_size+0x50/0x220
[  +0,000009] [  T89873]  amdgpu_drm_ioctl+0x4e/0x90 [amdgpu e27df8c7e1e85939e5f4b2290a27f9148f18165e]
[  +0,000272] [  T89873]  __x64_sys_ioctl+0x94/0xd0
[  +0,000008] [  T89873]  do_syscall_64+0x82/0x160
[  +0,000005] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000002] [  T89873]  ? __pm_runtime_suspend+0x6f/0xd0
[  +0,000005] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000002] [  T89873]  ? amdgpu_drm_ioctl+0x71/0x90 [amdgpu e27df8c7e1e85939e5f4b2290a27f9148f18165e]
[  +0,000120] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000002] [  T89873]  ? switch_fpu_return+0x4f/0xd0
[  +0,000005] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000001] [  T89873]  ? syscall_exit_to_user_mode+0x72/0x220
[  +0,000003] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000001] [  T89873]  ? do_syscall_64+0x8e/0x160
[  +0,000002] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000001] [  T89873]  ? cgroup_rstat_updated+0x69/0x220
[  +0,000005] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000001] [  T89873]  ? __count_memcg_events+0x58/0xf0
[  +0,000004] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000001] [  T89873]  ? count_memcg_events.constprop.0+0x1a/0x30
[  +0,000004] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000001] [  T89873]  ? handle_mm_fault+0xae/0x320
[  +0,000003] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000002] [  T89873]  ? srso_alias_return_thunk+0x5/0xfbef5
[  +0,000002] [  T89873]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
[  +0,000002] [  T89873] RIP: 0033:0x7effc9f0f70f
[  +0,000062] [  T89873] Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff>
[  +0,000001] [  T89873] RSP: 002b:00007ffc89d30190 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[  +0,000002] [  T89873] RAX: ffffffffffffffda RBX: 0000000000000003 RCX: 00007effc9f0f70f
[  +0,000000] [  T89873] RDX: 00007ffc89d30220 RSI: 00000000c01064ab RDI: 000000000000000e
[  +0,000001] [  T89873] RBP: 00007ffc89d30220 R08: 0000000000000002 R09: 0000000000000000
[  +0,000001] [  T89873] R10: 0000000000000001 R11: 0000000000000246 R12: 00000000c01064ab
[  +0,000001] [  T89873] R13: 000000000000000e R14: 0000000000000000 R15: 00005569b0723d60
[  +0,000003] [  T89873]  </TASK>
[  +0,000001] [  T89873] ---[ end trace 0000000000000000 ]---
[  +0,827147] [  T89873] [drm] Send DSC enable to synaptics
[  +0,616137] [  T89873] [drm] Send DSC disable to synaptics


[25. Aug 07:03] [ T162715] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=56809728, emitted seq=56809730
[  +0,000175] [ T162715] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process firefox pid 106082 thread firefox:cs0 pid 106168
[  +0,000139] [ T162715] amdgpu 0000:c2:00.0: amdgpu: GPU reset begin!
[  +2,008376] [ T162715] amdgpu 0000:c2:00.0: amdgpu: MES failed to respond to msg=REMOVE_QUEUE
[  +0,000006] [ T162715] [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
[  +0,205651] [ T162715] [drm:gfx_v11_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
[  +0,001518] [ T162715] amdgpu 0000:c2:00.0: amdgpu: MODE2 reset
[  +0,033866] [ T162715] amdgpu 0000:c2:00.0: amdgpu: GPU reset succeeded, trying to resume
[  +0,000693] [ T162715] [drm] PCIE GART of 512M enabled (table at 0x000000807FD00000).
[  +0,000252] [ T162715] [drm] VRAM is lost due to GPU reset!
[  +0,000006] [ T162715] amdgpu 0000:c2:00.0: amdgpu: SMU is resuming...
[  +0,002072] [ T162715] amdgpu 0000:c2:00.0: amdgpu: SMU is resumed successfully!
[  +0,002181] [ T162715] [drm] DMUB hardware initialized: version=0x08004000
[  +0,007275] [ T162715] [drm] kiq ring mec 3 pipe 1 q 0
[  +0,001930] [ T162715] amdgpu 0000:c2:00.0: [drm:jpeg_v4_0_hw_init [amdgpu]] JPEG decode initialized successfully.
[  +0,000478] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[  +0,000002] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[  +0,000002] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[  +0,000001] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[  +0,000001] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[  +0,000001] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[  +0,000001] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[  +0,000002] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[  +0,000001] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[  +0,000001] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[  +0,000001] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[  +0,000001] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring jpeg_dec uses VM inv eng 1 on hub 8
[  +0,000001] [ T162715] amdgpu 0000:c2:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 13 on hub 0
[  +0,001763] [ T162715] amdgpu 0000:c2:00.0: amdgpu: recover vram bo from shadow start
[  +0,000002] [ T162715] amdgpu 0000:c2:00.0: amdgpu: recover vram bo from shadow done
[  +0,000014] [ T162715] amdgpu 0000:c2:00.0: amdgpu: GPU reset(4) succeeded!
[  +0,971300] [  T89874] [drm] Send DSC enable to synaptics

Yesh your Mesa is still “too old”, you will need .6 version if i remember right to get the needed fix.

One thing may have broke for me after the update.
I can’t use UltiMaker Cura because I supposedly don’t have OpenGL 2.0 or newer anymore.

In Info Center there’s OpenGL (EGL) and OpenGL (GLX).
EGL says API version: 1.5 and OpenGL core profile version: 4.6,
while GLX says GLX version: 1.4 and Max core profile version: 4.6.

I would presume that it’s the core profile version that matters, as the version number matches the latest OpenGL release, but then why isn’t it working?