Hacker News

Both AWS and Mistral prices above are per minute of input audio.

If Voxtral can process rapid speech as well as it claims to, an obvious cost optimization would be to speed up normal laconic speech to the maximum speed the model can handle accurately.