you might also like shadow if you're interested in just local speech-to-text