Ryzen AI "Max" -- not so much?

Eugr · December 1, 2025, 11:37pm

In Linux, you can just allocate the minimum (512MB or 1GB) to iGPU, and it will function as unified memory.

dsfd · December 2, 2025, 12:02am

OK you’ve lost me but intrigued me… which will function as unified memory? the 1GB allocated to iGPU or the 127 GB not allocated to iGPU?

but even more important (to me) is whether some particular LLM functionality requires dedicated VRAM (or whatever it’s going to be called) or is everything smart enough (in LInux) to “share and share alike” from the common unified RAM pool? I have already seen error messages to the effect “insufficient VRAM” because another process had it locked or whatever…

if there really is a way to just let everything take what they need at the moment and not preclude other processes that would be ideal (I guess) but I’m not sure that is the case.

Eugr · December 2, 2025, 12:15am

Not allocated to GPU. Don’t know about all tools, but llama.cpp, vllm, Vulkan, pytorch all can work with unified memory on Strix Halo just fine. Check out this page: AI Capabilities Overview – Strix Halo Wiki

dsfd · December 2, 2025, 12:30am

now it’s definitely worth looking into because I would prefer using Coqui TTS vs. Kokoro if nothing else in the stack started squabbling about VRAM – which I think Coqui requires vs. Kokoro which I think uses system RAM.

Topic		Replies	Views
LLM Performance Framework Desktop ai	26	7801	June 11, 2025
AMD Strix Halo (Ryzen AI Max+ 395) GPU LLM Performance Tests Framework Desktop ai	17	16223	September 29, 2025
[TRACKING] Will the AI Max+ 395 (128GB) be able to run gpt-oss-120b? Framework Desktop framework-desktop-ai-max-300 , ai	35	13459	January 25, 2026
Llama.cpp/vLLM Toolboxes for LLM inference on Strix Halo Framework Desktop	56	7888	February 2, 2026
DGX Spark vs. Strix Halo - Initial Impressions Framework Desktop	46	5162	December 26, 2025

Ryzen AI "Max" -- not so much?

Related topics