Yes I've tried Parakeet v3 too. For its own purpose - running locally - it's amazing.
The thing that's particularly amazing about this Voxtral model is how incredibly rock solid the accuracy is.
For the longest time previous models have been 'mostly correct' or as people have commented elsewhere on this HN thread, have dropped sentences or lost or added utterances.
I have no affiliation with these folks, but I tried and struggled to get this model to break even speaking as adversariately as I could.
I did say all the model. :)
Yes I've tried Parakeet v3 too. For its own purpose - running locally - it's amazing.
The thing that's particularly amazing about this Voxtral model is how incredibly rock solid the accuracy is.
For the longest time previous models have been 'mostly correct' or as people have commented elsewhere on this HN thread, have dropped sentences or lost or added utterances.
I have no affiliation with these folks, but I tried and struggled to get this model to break even speaking as adversariately as I could.
That's a totally different class of model.
Do you mean https://huggingface.co/nvidia/nemotron-speech-streaming-en-0... ?
Yes. That is it