Nice, I've also built something like this we use internally. Will it reduce token consumption as well?

Thanks. Re tokens reduction: not that I’m aware of. Would you mind explaining how it might? That could be a cool feature to add