Llama.cpp/vLLM Toolboxes for LLM inference on Strix Halo

2 Likes