sending the whole conversation to a cheap model could still be cheaper than sending just the latest message to the expensive one
you could even take this into account automatically to help decide
sending the whole conversation to a cheap model could still be cheaper than sending just the latest message to the expensive one
you could even take this into account automatically to help decide