Request: verify dGPU support

Djip · October 1, 2025, 7:52pm

I have some question on that,

What did you use as LLM server that can do that?
the KV is split over all layer, so doing so need many echange from RAM ↔ VRAM, with only a 4xPCIe it is realy slow (8Gb/s vs 256Gb/s)
did you have any bench on other platform of that config?
using NVIDIA dGPU need to mix CUDA/HIP at runtime what do you use for that?
what speed up did you expect?

Topic		Replies	Views
VRAM allocation for the 7840U frameworks Framework Laptop 13	27	11368	August 13, 2024
Look there is now build for rocm with official support for the iGPU (780M+?) Framework Laptop 16 framework-laptop-16-amd-7040 , framework-laptop-16-amd-ai-300 , graphics-module-amd-rx7700s	7	150	October 25, 2025
Help Me Make Up My Mind (FW13 Ryzen AI 9 HX 370) Framework Laptop 13 framework-laptop-13-amd-ai-300 , ai	18	2788	July 11, 2025
AMD Strix Halo (Ryzen AI Max+ 395) GPU LLM Performance Tests Framework Desktop ai	17	9179	September 29, 2025
AMD ROCm does not support the AMD Ryzen AI 300 Series GPUs Framework Laptop 13 framework-laptop-13-amd-ai-300 , ai	56	9156	October 21, 2025