I think we're talking about STT (speech-to-text) here, not TTS.

whoops! absolutely correct, that's what I meant.