Did you consider their peak hours and model usage multiplier? Read the green box https://docs.z.ai/devpack/overview#usage-instruction
I had the Lite plan, I NEVER maxed out the quota because I considered these things. If I, for example, switched over to GLM-5-Turbo, then I could've easily burned through quota.
I just read it and honestly it left an even worse taste in my mouth.
>GLM-5.2 and GLM-5-Turbo are advanced models designed to rival Claude Opus model. Its usage will be deducted at 3 × during peak hours and 2 × during off-peak hours.
Claude certainly does not punish me for using their best models. Why should this "up and coming" company do it?
I thought the up and coming ai companies was supposed to have some kind of leverage in terms of price/performance (see deepseeks insanely cheap V4 flash and pro).
With a claude code plan, can you generate as many tokens with Opus as you can with Haiku before filling your 5 hour window? The same is going on here.