Take a look at Ollama Cloud: https://ollama.com/pricing

You get access to a whole bunch of bleeding edge open models including GLM-5.2, Kimi K2.7, DeepSeek 4 Pro, etc. Inference is run on US/SG/EU cloud providers with zero data retention policies. The $20/mo tier is very generous, in my experience.

They don’t have a statement about where it is run or data retention on the GLM5.2 model. They do state that for others, like MiniMax.

There's a blanket statement at the bottom of the pricing page, which I would hope also applies to GLM-5.2:

> Where are models hosted?

> Ollama hosts models and compute resources primarily in the United States. To serve global demand, we may route to Europe and Singapore for additional capacity.

> Is my prompt or response data trained on?

> Prompt or response data is never logged or trained on.

> Who does Ollama partner with to host models?

> Ollama collaborates with NVIDIA Cloud Providers (NCPs) to host open models.

> When Ollama partners with providers, we require no logging, no training, and zero data retention policies in place.