> All your data is stored locally on your device, and your audio goes directly from your machine to your chosen cloud provider (Groq, OpenAI, ElevenLabs, etc.) or local provider (Speaches, owhisper, etc.)

Their point is they aren’t a middleman with this, and you can use your preferred supplier or run something locally.

The issue is

> All your data is stored locally on your device,

is fundamentally incapable with half of the following sentence.

I'd write it as

> All your data is stored locally on your device, unless you explicitly decide to use a cloud provider for dictation.

Great correction, wish I could edit the post! Updated the README to reflect this.