Linux kernel crashes on Gen 11

Which Linux distro are you using?
Manjaro (also tried on NixOS 22.11 and 23.05, and Fedora)

Which release version?
rolling…updated every week or so

Which kernel are you using?
up to 6.11

Which BIOS version are you using?
up to 3.20

Which Framework Laptop 13 model are you using?
11th Gen Intel® Core™

Hi all,

I own an 11th Gen Framework 13 since mid 2022.
It has an i7-1165G7 and 2x16GB RAM that I ordered from Framework with the laptop.
As for modules, I usually only have USB-A or C ports installed.

From day one, I have been plagued by constant kernel crashes, meaning the laptop freezes with a blinking CapsLock light. After that it usually shuts off after a couple of seconds, shortly turns back on and right off again. This happens intermittently, sometimes after only an hour of runtime, sometimes it runs for weeks, without any problems. The freeze happens under different workloads: playing games, watching youtube, or just coding in vim.

I have run memtest a couple of times, and recently even tried the DIMMs individually on either memory channel. The problem persisted, the memory seems to be ok.

Over the time, I used Fedora, NixOS and Manjaro, all of which had the same problem. Currently I’m on Manjaro with Kernel 6.10. I tried several different kernel parameters, especially concerning the iGPU (enable_psr, enable_guc, etc.) to no avail.

I haven’t gotten around to configuring kexec for a proper kernel crash dump, but at least the logs (including dmesg) didn’t show anything suspicious at the time of the crash. However they might just be incomplete.

Currently, I’m on firmware 3.20, but over time I tried every one since the original firmware, and none of them fixed the problem.

Also, I did change the nvme drive from the original Samsung 970 Evo Plus 1TB to a WD SN770 black 2TB, didn’t change anything.

So by now, I am really out of ideas on what to try next. If you need any more info, please let me know and I will provide it.
I know that there are several other topics describing similar problems, but they are either rather old or the solution doesn’t work for me.

Thank you in advance for your support! I am glad for any hints.

Configure kexec and get a proper crash dump, or configure pstore and get a panic trace. Without one of those it’s impossible to even guess which driver is crashing, much less fix anything.