Interesting, I assumed all model-routing was done utilizing an LLM. (I.e. non-deterministic.)

It’s possible that there’s a set of words or phrases that route deterministically to save money on obvious stuff.

I kind of wonder, though, which model they’re using to do the routing. It seems like a huge added cost to do these kinds of checks on every request

Wasn't it leaked in the Claude Code source that it was all regex?

[dead]