Re: the clips above

Although we can tell they are inaccurate, what percentage of people can visualize the prompts better in their mind’s eyes? I bet a substantial number can’t even tell the clips are generated if posted without context.

In a few aspects, these world models are already pretty close to what we have in our brains.

So what's the point? We built a machine which is only capable of letting people stop having to imagine things?