anything is more accurate than the llms at generating images. chatgpt, google gemini, all of them... they're not optimized for image generation. it's why veo is an entirely different model from google for example. and even veo isn't the best video model either. people dedicated to images and video are just spending more time here (such as black forest labs). as a result, those specialized models are better.

What's better than veo?