Resolving PCIE storage instability using Linux Kernel flags

DebugDan · December 28, 2025, 10:33pm

I recently got a Framework Desktop motherboard and was excited to upgrade. But I ran into some issues with my ~6 hard disk storage array and pcie-sata controller

When running several parallel heavy read / write workloads and strongly stressing the disks and controller I would get this dmesg line.

ahci 0000:c1:00.0: Using 64-bit DMA addresses

Followed immediately by corrupt sector reads seemingly randomly distributed across every disk in the array. I tried using a Marvell 88SE9215 controller and a ASM1166 controller without any improvement. From what I’ve come to understand this is the SATA controller card resetting itself mid operation.

I spent quite a bit of time chasing this as a physical issue (new card, cable replacements, power supply check etc etc) before settling in on it being a Framework Desktop issue that had nothing to do with the other hardware.

By adding this to /etc/default/grub I could stop the card from resetting and errors from occurring. Although it does break suspend.

amd_iommu=off iommu=soft pci=nomsi pci=noaer libata.force=noncq ahci.mobile_lpm_policy=0 libata.noacpi=1 pcie_aspm=off

This is a shotgun of flags I saw in various forum posts and some LLM usage. After a lot of trial and error I have found that

amd_iommu=off pci=noaer pcie_aspm=off

Resolves the issue and maintains suspend. Edit it into your etc/default/grub and run sudo grub2-mkconfig -o /boot/grub2/grub.cfg reboot and you should be good to go until Framework releases the firmware fix for this issue.

I hope this is helpful to those in a similar boat and saves you all some debugging.

JMP_LZDOG · December 29, 2025, 12:40am

Hi @DebugDan I wonder if this is related to the issues here?

Mario_Limonciello · December 30, 2025, 1:46am

Can you try with just ASPM off? If that helps it should be fixable by firmware I would expect.

JMP_LZDOG · December 30, 2025, 4:46am

I’ve been trying it with just the ASPM off flag but it didn’t work for me.

@DebugDan I tried both the simpler 3-flag approach as well as the full 8-flag approach and neither one has solved the issue for me.

In my case I am using a PCIe x4 to x16 adapter and then a riser cable, so I can’t rule out for certain that it isn’t something there that is also interfering but essentially my situation is that the GPU keeps switching between PCIe 4.0 to 1.0 to 2.0 to 3.0 to 4.0 and then back to 1.0 again.

Edit: Also found this - wondering if it may be related?

Ernie_Hobbs · December 30, 2025, 5:05am

Maybe. I debated trying to use the PCIe slot with a riser to run my 7900 xtx GPU, but after days of checking all the specs and considering all of the options, I just didn’t find a configuration that I was confident it would work. So I abandoned that plan and just got an eGPU (there is another thread on this) with USB4. That has worked pretty well for me so far. I decided to use the PCIe slot for a third SSD, which is also working well. Both took some work and a little trial and error, but it was worth it.

Graphics: AMD Radeon 8060S
AMD Radeon 8060S, 98304 MB LPDDR5 SDRAM
Graphics: AMD Radeon RX 7900 XTX (Navi31 XTX) [ASRock]
AMD Radeon RX 7900 XTX, 24576 MB GDDR6 SDRAM
Drive: CT4000P310SSD8, 3907.0 GB, NVMe
Drive: ORICO, 4000.8 GB, NVMe
Drive: CT4000P310SSD8, 3907.0 GB, NVMe
Drive: Western Digital SN560E, 1953.5 GB, NVMe
OS: Microsoft Windows 11 Professional (x64) Build 26200.7462 (25H2)

JMP_LZDOG · December 30, 2025, 5:17am

Do you know if it was also jumping between Gen 4.0 and Gen 1.0 (and in between) ?

I filed a support request referencing this thread in the hope that maybe between them and the community we might find a solution.

Edit: Found this on GitHub that may be related too:

github.com/FrameworkComputer/SoftwareFirmwareIssueTracker

FW Desktop BIOS 3.04 PCI-E 4x slot failing most of the times to get passed boot

opened 03:43PM - 14 Dec 25 UTC

stau

bug Desktop - AMD Ryzen AI 300

When using a PCI-E 4x NVMe adapter WITH NVMe disk fails to get passed the BIOS m…ost of the times. **Sanity checks** * Funnily enough, when there is no disk, we do get passed the BIOS. * The NVMe disk works fine in the normal slots. * The NVMe adapters tested work fine in other systems and can access the NVMe disk * [Silverstone ECM22 ](https://www.silverstonetek.com/en/product/info/expansion-cards/ECM22/) * [Icybox IB-PCI208-HS](https://icybox.de/product/interne_speicherloesungen/IB-PCI208-HS) * Using the Arch Linux installer; when in the off-chance we manage to get passed boot, I managed to see the following errors ```nvme nvme2: I/O tag 24 (1018) QID 0 timeout, disable controller nvme nvme2: failed to read smart log (error -5) nvme 0000:c2:00.0: probe with drive nvme failed with error -5``` The set-up with the `Icybox IB-PCI208-HS` booted fine (aka got passed the BIOS flash screen) with the BIOS 3.02, however had soft lock-ups when installing the [Proxmox VE 9.1 ]([Proxmox VE 9.1 ISO Installer](https://proxmox.com/en/downloads/proxmox-virtual-environment/iso/proxmox-ve-9-1-iso-installer) I will attempt to downgrade the BIOS to 3.02; since I have successfully installed Proxmox VE 9.1 on BIOS 3.04 and report back. Just to note here, when I was on the BIOS 3.02 I got the following errors when trying to install Proxmox VE 9.1: ``` BUG: soft lockup - CPU#1 stuck for 26s ``` I tried setting various kernel parameters, but to no avail. The failing to get passed BIOS was ESPECIALLY confusing and alarming, as the PCI-E adapter was connected during the BIOS update. Obviously coming back to a black screen after a BIOS update was not encouraging. Edit#1: Since I did not use fwupgrade to perform the BIOS, I could only downgrade to BIOS 3.03. * With PCI-E expansion: * The behavior is that it takes a long time to get to the BIOS to load and then it fails to boot Proxmox being stuck at "Loading initial ramdisk". * Without PCI-E expansion: * works Edit#2: * I cannot find where I can version 3.02 of the BIOS, so that's as far as it goes as getting that part to work. However the softlock issues which prompted me to update the BIOS were on 3.02...

Ernie_Hobbs · December 30, 2025, 1:42pm

I can confirm that manually forcing the PCIe slot to Gen 3 in the BIOS was what worked for me. I was seeing similar instability where it wouldn’t negotiate properly or would hang. Even though my drive and adapter claimed Gen 4/5 compatibility, the Framework Desktop board seems very sensitive on that x4 slot. As soon as I hard-locked it to Gen 3, it worked as it should.

JMP_LZDOG · May 1, 2026, 10:20pm

It seems the GitHub ticket is still open with no resolution; I tried the latest 3.05 firmware and tried both limiting the PCIe speed to Gen 3 (and leaving it as Gen 4) but have had no luck.

Topic		Replies	Views
Pcie slot instability? Framework Desktop	1	260	December 29, 2025
What are you all doing with the PCIE slot? Framework Desktop	61	5295	January 9, 2026
PCIe Slot Killing Bluetooth? Framework Desktop	3	160	February 2, 2026
Framework Desktop reboot fail with PCIe gen3 Optane SSD Framework Desktop	18	616	February 7, 2026
Issues with getting a PCIe device detected using v4 BIOS on the 7940HS Expansion Bay	18	239	February 21, 2026

Resolving PCIE storage instability using Linux Kernel flags

Related topics