AMD AI Max+ 395 128GB with cline

Sam_H · September 3, 2025, 12:51pm

There is a recent blog post from Cline about this that was pretty good: Cline + LM Studio: the local coding stack with Qwen3 Coder 30B - Cline Blog

They suggested using Qwen3 Coder 30B, and they have an option for a “compact prompt” to reduce context use. I’ve only started playing with a little, so I don’t have a good impression yet of how useful it can be; it’s definitely slower compared to cloud models, but not unusable.

Note that to get the model to load with full context size, I had to increase GTT limits (on Linux), as shown here: iGPU VRAM - How much can be assigned? - #7 by lhl

Topic		Replies	Views
Using a Framework Desktop for local AI Blog	8	3694	July 23, 2025
What AI/ML Use Cases Should We Demo? Framework Desktop	36	2500	July 31, 2025
Help Me Make Up My Mind (FW13 Ryzen AI 9 HX 370) Framework Laptop 13 framework-laptop-13-amd-ai-300 , ai	18	2612	July 11, 2025
Wow! FWLP16 - Compilation Performance is outstanding! Framework Laptop 16 framework-laptop-16-amd-7040	1	351	June 12, 2024
Batch 10 Guild Framework Laptop 13	46	10146	April 26, 2022

AMD AI Max+ 395 128GB with cline

Related topics