Good question. The one thing that Altman really seemed keen to play up was the whole integrate yourself into the video which from what I watched is definitely a step beyond the more conventional Image-To-Video models.

Depressingly that's probably a killer feature since if there's one thing people want to see more of it's themselves.

IMHO I also think the fact that they're trying to position themselves as a sort of infinite doom-scrolling tiktok lends support to the idea that their models are still only suitable for relatively short videos since coherency probably falls off a cliff after 30-60 seconds.