In case it helps. When used with the FW13 AMD or FW16 AMD, LLMs don’t use any VRAM. They use GTT instead that needs no BIOS setting at all.
For example I use a GTT of 60 GBytes with ROCM / LLMs so I want the VRAM to be as small as possible so more can be used for GTT.
E.g.