Just like using an AI model, you can’t actually know for sure that it won’t do anything malicious with what interfaces you give it access to. You just have to trust it.

Well, you can at least check if there is network traffic to AWS or something similar.

But wouldn't that look the same as actually querying the model? Or am I missing the joke?