Qwen and GLM both promise the stars in the sky every single release and the results are always firmly in the "whatever" range

Qwen famously benchmaxxes. GLM is more robust, I'd say it's comparable to DeepSeek in that regard.