The main benefit I see for cloud platforms: caching/co-hosting various services based on model instead of (model + user's API layer on top).

For the end user, it would be one less deployment headache to worry about: not having to package ollama + the model into docker containers for deployment. Also a more standardized deployment for hardware accelerated models across platforms.