I self-host Flash actually, but yeah.

When I use their API I use it knowing that they probably train on the data, and knowing that it's probably used to improve future iterations of their models.

But I use their API extremely rarely lately, because local Flash is good enough for me the vast majority of the time

And you’ve opened wireshark and verified the model is sending absolutely nothing? Not caching and sending later, etc?

If you self host then you can audit the open-source llama.cpp or whichever other program you are using for inference, to see exactly what it does, and also whichever open-source harness you use for implementing a coding assistant or other agentic workflow.

The model consists of a bunch of data files, it does absolutely nothing by itself.

If you run inference on your own hardware, you have absolute control on how the LLM is used, not like when you use an external service provider.

Not sure if you mean something else, but the model itself is not able to send anything.