> All SOTA model providers are losing money.

Source? I only read one article on this topic and they approximated gross margins at 50%.

> When users run Opus, they are essentially renting a GPU cluster worth half a million dollars for a $100/$200 subscription.

They use a large batch size, you're sharing the GPU with many other people.