Thanks for sharing! I was literally getting ready to build, essentially, this. Now it looks like I don't have to!
Have you ever considered using a foot-pedal for PTT?
Apple incidentally already has native STT, but for some reason they just don't use a decent model yet.
They do, and they even have that nice microphone F5 key for it, and an ideal OS level API making the input experience >perfect<.
Apparently they do have a better model, they just haven't exposed it in their own OS yet!
https://developer.apple.com/documentation/speech/bringing-ad...
Wonder what's the hold up...
For footpedal:
Yes, conceptually it’s just another evdev-trigger source, assuming the pedal exposes usable key/button events.
Otherwise we’d bridge it into the existing external control interface. Either way, hooks are there. :)
The only issue with Apple models is that they do not detect languages automatically, nor switch if you do between sentences.
Parakeet does both just fine.
sorry, PTT?
push-to-talk.
ty