Arch linux random crashes

Which Linux distro are you using?
arch

last date updated?
a few days ago, logs are from different days

Which kernel are you using?
6.10

Which BIOS version are you using?
3.05

Which Framework Laptop 13 model are you using?
AMD Ryzenβ„’ 7040 Series

im having random crashes every few hours to few days. nomodeset seemed like it made it better but i think i still got a kernel panic, though it was a little while ago so i don’t remember.
im using kde plasma.

here are some of the crash logs

What software are you using for handling wireless connections? I just had a quick look at your first log and your NetworkManager died. There is a lot of logging for you Wifi connections before that which I don’t see at my Arch Linux setup. I’m not sure if this is related to your system crash but it may be an issue.

 9月 25 23:22:32 hostname python3[1489]: current ssid: T Bone 2 6GHz
 9月 25 23:22:32 hostname python3[1489]: Trying to set priority for /org/freedesktop/NetworkManager/Settings/1 to 22
 9月 25 23:22:32 hostname python3[1489]: Priority set for /org/freedesktop/NetworkManager/Settings/1 to 22
 9月 25 23:22:32 hostname python3[1489]: All connections processed.
 9月 25 23:22:32 hostname python3[1489]: best ssid T Bone 2 6GHz is current ssid T Bone 2 6GHz not changing anything
 9月 25 23:22:32 hostname python3[1489]: starting
 9月 25 23:22:32 hostname python3[1489]: 1727331747 1727331712.485673
 9月 25 23:22:32 hostname python3[1489]: Rescanning for Wi-Fi networks...
 9月 25 23:22:32 hostname python3[1489]: getting known networks
 9月 25 23:22:32 hostname python3[1489]: getting avialable netowkr
 9月 25 23:22:32 hostname python3[1489]: get current conn
 9月 25 23:22:32 hostname python3[1489]: T Bone 2 6GHz:
 9月 25 23:22:32 hostname python3[1489]:     frequency: 7015
 9月 25 23:22:32 hostname python3[1489]:     strength:  63
 9月 25 23:22:32 hostname python3[1489]:     is_known:  True
 9月 25 23:22:32 hostname python3[1489]: T Bone 2 5GHz:
 9月 25 23:22:32 hostname python3[1489]:     frequency: 2417
 9月 25 23:22:32 hostname python3[1489]:     strength:  79
 9月 25 23:22:32 hostname python3[1489]:     is_known:  False
 9月 25 23:22:32 hostname python3[1489]: T Bone 2 IoT 5GHz:
 9月 25 23:22:32 hostname python3[1489]:     frequency: 2417
 9月 25 23:22:32 hostname python3[1489]:     strength:  79
 9月 25 23:22:32 hostname python3[1489]:     is_known:  False
 9月 25 23:22:32 hostname python3[1489]: Super Fly For A Wifi 5G-EX:
 9月 25 23:22:32 hostname python3[1489]:     frequency: 2417
 9月 25 23:22:32 hostname python3[1489]:     strength:  42
 9月 25 23:22:32 hostname python3[1489]:     is_known:  False
 9月 25 23:22:32 hostname python3[1489]: T Bone 2 IoT Local 2.4GHz:
 9月 25 23:22:32 hostname python3[1489]:     frequency: 2417
 9月 25 23:22:32 hostname python3[1489]:     strength:  97
 9月 25 23:22:32 hostname python3[1489]:     is_known:  False
 9月 25 23:22:32 hostname python3[1489]: T Bone 2 2.4GHz:
 9月 25 23:22:32 hostname python3[1489]:     frequency: 2417
 9月 25 23:22:32 hostname python3[1489]:     strength:  97
 9月 25 23:22:32 hostname python3[1489]:     is_known:  False
 9月 25 23:22:32 hostname python3[1489]: T Bone 2 IoT 2.4GHz:
 9月 25 23:22:32 hostname python3[1489]:     frequency: 2417
 9月 25 23:22:32 hostname python3[1489]:     strength:  97
 9月 25 23:22:32 hostname python3[1489]:     is_known:  False
 9月 25 23:22:32 hostname python3[1489]: T Bone 2:
 9月 25 23:22:32 hostname python3[1489]:     frequency: 2417
 9月 25 23:22:32 hostname python3[1489]:     strength:  79
 9月 25 23:22:32 hostname python3[1489]:     is_known:  False
 9月 25 23:22:32 hostname python3[1489]: CAEDH:
 9月 25 23:22:32 hostname python3[1489]:     frequency: 2417
 9月 25 23:22:32 hostname python3[1489]:     strength:  62
 9月 25 23:22:32 hostname python3[1489]:     is_known:  False
 9月 25 23:22:32 hostname python3[1489]: wifi6e-control:
 9月 25 23:22:32 hostname kded6[3677]: kf.networkmanagerqt: void NetworkManager::ConnectionPrivate::onPropertiesChanged(const QVariantMap&) Unhandled property "VersionId"

[…]

 9月 25 23:25:27 hostname systemd-coredump[15987]: Process 1322 (NetworkManager) of user 0 terminated abnormally with signal 11/SEGV, processing...
 9月 25 23:25:27 hostname systemd[1]: Created slice Slice /system/drkonqi-coredump-processor.
 9月 25 23:25:27 hostname systemd[1]: Created slice Slice /system/systemd-coredump.
 9月 25 23:25:27 hostname systemd[1]: Started Process Core Dump (PID 15987/UID 0).
 9月 25 23:25:27 hostname systemd[1]: Started Pass systemd-coredump journal entries to relevant user for potential DrKonqi handling.
 9月 25 23:25:28 hostname systemd-coredump[15988]: Process 1322 (NetworkManager) of user 0 dumped core.
                                                                           
                                                                           Stack trace of thread 1322:
                                                                           #0  0x0000609d36f4ecc4 n/a (NetworkManager + 0x8dcc4)
                                                                           #1  0x0000609d36f879de n/a (NetworkManager + 0xc69de)
                                                                           #2  0x0000609d36f87c6a n/a (NetworkManager + 0xc6c6a)
                                                                           #3  0x0000609d36fbbd11 n/a (NetworkManager + 0xfad11)
                                                                           #4  0x0000609d3706b0f2 n/a (NetworkManager + 0x1aa0f2)
                                                                           #5  0x00007283066c0e52 n/a (libgio-2.0.so.0 + 0x107e52)
                                                                           #6  0x00007283064c6559 n/a (libglib-2.0.so.0 + 0x5d559)
                                                                           #7  0x0000728306529157 n/a (libglib-2.0.so.0 + 0xc0157)
                                                                           #8  0x00007283064c7287 g_main_loop_run (libglib-2.0.so.0 + 0x5e287)
                                                                           #9  0x0000609d36eeedab n/a (NetworkManager + 0x2ddab)
                                                                           #10 0x0000728305e10e08 n/a (libc.so.6 + 0x25e08)
                                                                           #11 0x0000728305e10ecc __libc_start_main (libc.so.6 + 0x25ecc)
                                                                           #12 0x0000609d36eef5c5 n/a (NetworkManager + 0x2e5c5)
                                                                           
                                                                           Stack trace of thread 1336:
                                                                           #0  0x0000728305f021fd syscall (libc.so.6 + 0x1171fd)
                                                                           #1  0x00007283064f7f20 g_cond_wait (libglib-2.0.so.0 + 0x8ef20)
                                                                           #2  0x000072830648e98c n/a (libglib-2.0.so.0 + 0x2598c)
                                                                           #3  0x00007283064fe137 n/a (libglib-2.0.so.0 + 0x95137)
                                                                           #4  0x00007283064fa026 n/a (libglib-2.0.so.0 + 0x91026)
                                                                           #5  0x0000728305e7f39d n/a (libc.so.6 + 0x9439d)
                                                                           #6  0x0000728305f0449c n/a (libc.so.6 + 0x11949c)
                                                                           
                                                                           Stack trace of thread 1335:
                                                                           #0  0x0000728305ef6bb0 ppoll (libc.so.6 + 0x10bbb0)
                                                                           #1  0x0000728306529227 n/a (libglib-2.0.so.0 + 0xc0227)
                                                                           #2  0x00007283064c5a55 g_main_context_iteration (libglib-2.0.so.0 + 0x5ca55)
                                                                           #3  0x00007283064c5ab2 n/a (libglib-2.0.so.0 + 0x5cab2)
                                                                           #4  0x00007283064fa026 n/a (libglib-2.0.so.0 + 0x91026)
                                                                           #5  0x0000728305e7f39d n/a (libc.so.6 + 0x9439d)
                                                                           #6  0x0000728305f0449c n/a (libc.so.6 + 0x11949c)
                                                                           
                                                                           Stack trace of thread 1337:
                                                                           #0  0x0000728305ef6bb0 ppoll (libc.so.6 + 0x10bbb0)
                                                                           #1  0x0000728306529227 n/a (libglib-2.0.so.0 + 0xc0227)
                                                                           #2  0x00007283064c7287 g_main_loop_run (libglib-2.0.so.0 + 0x5e287)
                                                                           #3  0x00007283066ccb44 n/a (libgio-2.0.so.0 + 0x113b44)
                                                                           #4  0x00007283064fa026 n/a (libglib-2.0.so.0 + 0x91026)
                                                                           #5  0x0000728305e7f39d n/a (libc.so.6 + 0x9439d)
                                                                           #6  0x0000728305f0449c n/a (libc.so.6 + 0x11949c)
                                                                           ELF object binary architecture: AMD x86-64
 9月 25 23:25:28 hostname systemd[1]: systemd-coredump@0-15987-0.service: Deactivated successfully.
 9月 25 23:25:28 hostname kernel: BUG: unable to handle page fault for address: ffffd1f6c5000008

While there are loads of logs for network connections in there, which certainly is odd, I think this might be more important:

BUG: kernel NULL pointer dereference, address: 0000000000000084

Which is followed by this stack trace:

 9月 25 23:35:02 hostname kernel: Call Trace:
 9月 25 23:35:02 hostname kernel:  <TASK>
 9月 25 23:35:02 hostname kernel:  ? __die_body.cold+0x19/0x27
 9月 25 23:35:02 hostname kernel:  ? page_fault_oops+0x15a/0x2d0
 9月 25 23:35:02 hostname kernel:  ? exc_page_fault+0x81/0x190
 9月 25 23:35:02 hostname kernel:  ? asm_exc_page_fault+0x26/0x30
 9月 25 23:35:02 hostname kernel:  ? _raw_spin_lock+0x17/0x30
 9月 25 23:35:02 hostname kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
 9月 25 23:35:02 hostname kernel:  detach_extent_buffer_folio+0x70/0x140 [btrfs c0b45e615b380b68109f2d19d5a8d2d579129df0]
 9月 25 23:35:02 hostname kernel:  btrfs_release_extent_buffer_pages+0x5d/0x80 [btrfs c0b45e615b380b68109f2d19d5a8d2d579129df0]
 9月 25 23:35:02 hostname kernel:  release_extent_buffer+0x37/0xb0 [btrfs c0b45e615b380b68109f2d19d5a8d2d579129df0]
 9月 25 23:35:02 hostname kernel:  btree_release_folio+0x26/0x40 [btrfs c0b45e615b380b68109f2d19d5a8d2d579129df0]
 9月 25 23:35:02 hostname kernel:  shrink_folio_list+0x6cd/0xd60
 9月 25 23:35:02 hostname kernel:  evict_folios+0x275/0x630
 9月 25 23:35:02 hostname kernel:  try_to_shrink_lruvec+0x228/0x2b0
 9月 25 23:35:02 hostname kernel:  shrink_one+0x105/0x1f0
 9月 25 23:35:02 hostname kernel:  shrink_node+0x962/0xb70
 9月 25 23:35:02 hostname kernel:  balance_pgdat+0x580/0xa00
 9月 25 23:35:02 hostname kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
 9月 25 23:35:02 hostname kernel:  ? psi_task_switch+0xd6/0x230
 9月 25 23:35:02 hostname kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
 9月 25 23:35:02 hostname kernel:  ? finish_task_switch.isra.0+0x99/0x2e0
 9月 25 23:35:02 hostname kernel:  kswapd+0x1f5/0x380
 9月 25 23:35:02 hostname kernel:  ? __pfx_autoremove_wake_function+0x10/0x10
 9月 25 23:35:02 hostname kernel:  ? __pfx_kswapd+0x10/0x10
 9月 25 23:35:02 hostname kernel:  kthread+0xcf/0x100
 9月 25 23:35:02 hostname kernel:  ? __pfx_kthread+0x10/0x10
 9月 25 23:35:02 hostname kernel:  ret_from_fork+0x31/0x50
 9月 25 23:35:02 hostname kernel:  ? __pfx_kthread+0x10/0x10
 9月 25 23:35:02 hostname kernel:  ret_from_fork_asm+0x1a/0x30
 9月 25 23:35:02 hostname kernel:  </TASK>

Which the lines of the stacktrace mention btrfs and balance_pgdat.

I see this bug report for a null pointer dereference in btrfs from September 27th which mentions calling various balance functions in the path of the error: btrfs: fix a NULL pointer dereference when failed to start a new trasacntion - Patchwork

But I am certainly not an expert on the Linux kernel, I’m just doing some keyword matching, so I could be wrong :person_shrugging:

1 Like

im using some custom software controlling NetworkManager i wrote myself, that’s what those logs are.

im using custom software controlling NetworkManager that’s what all the logs are from, probably should have mentioned that in the original post, i forgot it logs so much

Try this patch. It helps a similar reported crash in btrfs due to a cleanup issue in amd-sfh.

[PATCH] HID: amd_sfh: Return immediately if no sensor is found - Basavaraj Natikar (kernel.org)

1 Like

btw some of the logs are capped to only the last few thousand lines with tail.

i tried to edit the original post to put that as an edit but it said β€˜You must choose at least 2 tags’ and it won’t let me pick any more tags

Actually here’s the version headed to Linus soon. Use this one.

[PATCH] HID: amd_sfh: Switch to device-managed dmam_alloc_coherent() - Basavaraj Natikar (kernel.org)

3 Likes