Water level in a glass changing between shots is one thing, the protagonist’s face and clothes changing is another.

Location consistency is important. Even something as simple and subtle as breaking the 180-rule [1] feels super uncanny to most audiences. Let alone changing the set the actor occupies, their wardrobe, props, etc.

There are lots of tools being built to address this, but they're still immature.

https://x.com/get_artcraft/status/1972723816087392450 (This is something we built and are open sourcing - still has a ways to go.)

ComfyUI has a lot of tools for this, they're just hard to use for most people.

[1] https://en.wikipedia.org/wiki/180-degree_rule

Well put. Honestly the actor part is mostly solved by now, the tricky part is depicting any kind of believable, persistent space across different shots. Based off of amateur outputs from places like https://www.reddit.com/r/aivideo/, at least!

This release is clearly capable of generating mind-blowingly realistic short clips, but I don't see any evidence that longer, multi-shot videos can be automated yet. With a professional's time and existing editing techniques, however...

[deleted]