I have an opencode setup where I have GLM-4.7-Flash as more of an architecting agent and qwen3-coder-next as the implementing agent. But glm just does not perform well at all it is super slow compared to qwen which is way bigger and way more performant. Anyone have any suggestions for a good reasoning model to use instead? looking for something 20-30 Gb. Anyone have experience with a better model