Llama.cpp/vLLM Toolboxes for LLM inference on Strix Halo

@Eugr
Can you please let me know what’s the data format you used? is it MXFP4?