[TRACKING] Request: verify dGPU support

JMP_LZDOG · December 22, 2025, 5:51am

I’m also trying to get the motherboard to work with dGPUs.

@Lincoln_Chen did you end up abandoning the use of the PCIe 4.0 x4 slot? I may have misunderstood, but it seemed like you were last using the NVMe M.2 slot(s) instead?

@Hrothmund how has that ADT-Link adapter worked for you?

In my experiments, I’m using the ADT-Link R23A-AMP (x4 to x16) and then a normal GPU riser, but the connection isn’t stable and keeps dropping to PCIe Gen 1.0. Forcing Linux to stick to Gen 1.0 solves the renegotiation issue but I’m still seeing GPU usage spike up (and power draw) for no reason.

Thanks in advance for any help

Lincoln_Chen · December 23, 2025, 1:51am

I swapped dGPU to the nvme slot for ergonomics and have my SSD hooked up to a PCIe to SATA adaptor. Works fine, for what it’s worth.

From memory my original config was a PCIe x4 to x16 adaptor, which also worked fine.

JMP_LZDOG · December 25, 2025, 2:48am

@Lincoln_Chen Thanks! To clarify, was this the one that ended up working OK in the end? I’ve ordered a different one but if it doesn’t work I may try this one out of desperation.

JMP_LZDOG · December 25, 2025, 2:50am

How did your custom adapter testing go?

Lincoln_Chen · December 25, 2025, 4:38am

From memory, that one worked. However I’m currently using an ADT-Link K43SR.

As a side note, I’ve recently found that running the display off the iGPU works much better.

pixel1 · December 31, 2025, 7:16am

Is there a simpler one of these for pcie storage?

Hardware_Fox · January 6, 2026, 8:55pm

I have been very busy the last couple of months, and I haven’t had a chance to finalize the design. I had most of it done, but I had to allocate my time elsewhere for a while. I didn’t even get a chance to assemble my custom framework motherboard build until my Christmas break. I have ordered all of the components, though, and they are sitting in the manufacturer’s warehouse ready for production. I will have a lot more time for the next little while so I should be able to finish it and test it out.

Hrothmund · January 7, 2026, 5:50pm

I never got it working properly, so I set it aside for other projects for the time being. I have a lot to learn on the software side anyway - even if it did work, I’m not sure if I’d know how to configure my llama docker container to offload some layers to the dGPU.

At this point I’m waiting for a firmware update that addresses issues with dGPUs, or for more people to report that they got such a setup working, before I revisit it.

Sorry I’m not much help.

Richard2 · January 11, 2026, 4:55am

After I pre-ordered my FW desktop, I’ve read through this thread many times, and to be honest, I was worried about my homelab plan using a MAX 395 with an RTX 3070. I’m happy to report that I finally received my Framework desktop mainboard, and with the help of a 10cm PCIe 4.0 x4 to x4 riser card (without supplemental power), my setup is working flawlessly. I’m running CachyOS and getting PCIe 4.0 x4 link capability with resizable BAR enabled. I haven’t encountered any boot issues so far.

DigitalPizza · February 4, 2026, 7:44am

I did some research in to your question. Perhaps using disaggregate prefilling is an option. In vllm there is an option to use two vllm instances one for the prefill phase (assigned to the dgpu; the halo strix is less good at prompt processing than a dedicated graphics card or the dgx spark), the other one for the decode phase (assigned to GPU of the halo strix or even it’s CPU). The prefill phase is more computation heavy and the decode phase is more memory intensive. (see vllm/docs/features/disagg_prefill.md at main · vllm-project/vllm · GitHub ) The same technique is used by exo to combine various devices over the network. (see VPMTvC7faJE on youtube; sorry I can’t post more than two links)
The framework desktop has only a pcie 4x4 interface, so bandwidth is limited. This technique is constantly sending the result of the prefill phase to the decode phase, so I think the low pcie bandwidth is less noticable. In vllm it is possible to distribute the load over multiple GPU’s, but due to the memory intensiveness of the decode phase and the pcie bottleneck, this is not really an option. (correct me if I’m wrong)
A bonus is that normally vllm does not allow to use CUDA and rocm at the same time. By splitting this in two vllm instances, i’m hoping this is possible.
Another option could be ikllama ( GitHub - ikawrakow/ik_llama.cpp: llama.cpp fork with additional SOTA quants and improved performance ). But it has no support for two GPU’s?

Hrothmund · February 9, 2026, 6:46am

Update: I tried the 3090 again today, and it worked fine. Variables that have changed:
-I ordered a short cable from ADT-Link. The first one was 20cm, this one is 5cm. I had to bend the longer cable, and it’s pretty stiff, so that might have messed something up. I still had to bend the shorter one a bit - if I had to do it over again, I’d order a 3cm cable.
-When I removed the 3090 a few months ago, I ripped out the nvidia drivers and blacklisted nouveau. When I reinstalled the card today, Fedora seems to have auto-installed the latest nVidia drivers.

I’m still experimenting, but it seems to take bloody forever to load a model - several minutes. Not sure if that’s due to the x4 PCIe link or something else. Once it’s loaded, the performance is decent:
hrethric@delphi:~/llama.cpp$ ./build/bin/llama-bench -m /opt/llm-models/gemma-3-27b-it-q4_0.gguf -p 512 -n 128 -ngl 999
ggml_cuda_init: found 1 CUDA devices:
Device 0: NVIDIA GeForce RTX 3090, compute capability 8.6, VMM: yes

model	size	params	backend	ngl	test	t/s
gemma3 27B Q4_0	16.04 GiB	27.01 B	CUDA	999	pp512	283.98 ± 5.82
gemma3 27B Q4_0	16.04 GiB	27.01 B	CUDA	999	tg128	33.59 ± 0.28

build: 39bf692af (7973)
hrethric@delphi:~/llama.cpp$

Lincoln_Chen · February 9, 2026, 9:25pm

I realise this is unrelated, but someone in this thread was talking about making a custom heatsink. Looks like the guys at NFC might’ve made an adapter plate to allow fitting standard cooler mounts (see video here).

L_J · February 9, 2026, 11:15pm

The build is darn nice

JMP_LZDOG · March 15, 2026, 3:16am

Which brand of riser did you get?

It seems like ADT-Link is what has worked the most often for people.

Richard2 · March 15, 2026, 8:46pm

GLOTRENDS , I’m not particularly fond of or against this brand, I bought the shortest and cheapest option I could find on Amazon. In my opinion, shorter is better, and brand isn’t that important. If I can find a 5 cm one (@Hrothmund mentioned), I probably would buy that one. However, you should also make sure the cable fits your PC case.

Lukew4lker · March 16, 2026, 2:07pm

ADT link has come out with an new adapter that’s not janky like every other one and can be powered by 2x3 PCIE power connector.

https://www.aliexpress.us/item/3256810523706507.html

That, an oculink cable of the shortest length possible and the internal adapter work best for me.

I’ve recently upgraded to a 5090 and it’s working without issue. I’ll likely setup another thread for how my setup works as there are some bios tweaks in the backend you might want to do.

Guest383 · March 18, 2026, 1:22am

I’ve had some success using a MINISFORUM DEG1 eGPU Dock, its provided Oculink cable (~2 ft long), and a RIITOP PCIe to Oculink SFF-8612 Adapter SFF-8611 External Graphics Card for GPU Dock, eGPU, NVMe SSD (PCIe x4 to Oculink). I have a Mini-ITX case so there’s room for the PCIe card. The connection is sensitive. I had to re-seat the Oculink card and cable multiple times before it would be detected. The GPU disappeared after a day with the humorous error “GPU has fallen off the bus” (a reboot fixed it). It’s been stable for over for a day so far, and CUDA and Vulkan have run without issue for LLMs. I haven’t tried video out.

Thomas_Munn · March 18, 2026, 2:35am

If you have the original framework desktop do you like dremel the case to get this thing to fit? Curious about ‘case mods!’??

Guest383 · March 18, 2026, 3:05am

Unfortunately, I can’t help you there. I bought the desktop motherboard (without case) specifically because I wanted the option to use the PCIe slot.

Jason_Sharrad · March 19, 2026, 11:17pm

You can see my really bad cuts here.

Topic		Replies	Views
Look there is now build for rocm with official support for the iGPU (780M+?) Framework Laptop 16 framework-laptop-16-amd-7040 , framework-laptop-16-amd-ai-300 , graphics-module-amd-rx7700s	9	1241	January 12, 2026
Framework 13 + Ryzen AI + Linux Distro + LLM Linux ubuntu , fedora	20	5159	February 11, 2026
AMD Strix Halo (Ryzen AI Max+ 395) GPU LLM Performance Tests Framework Desktop ai	17	24136	September 29, 2025
AMD ROCm does not support the AMD Ryzen AI 300 Series GPUs Framework Laptop 13 amd-ai-300 , ai	57	17177	December 9, 2025
Help Me Make Up My Mind (FW13 Ryzen AI 9 HX 370) Framework Laptop 13 amd-ai-300 , ai	18	4699	July 11, 2025

[TRACKING] Request: verify dGPU support

Related topics