Find this sort of innovation far less interesting or exciting than the text & speech work, but it seems to be a primary driver of adoption for the median person in a way that text capability simply is not.

Video generation is extremely exciting a.k.a. https://video-zero-shot.github.io/

However, personalization (teleporting yourself into a video scene) is boring to me. At its core, it doesn't generate new experience to me. My experience is not defined by photos / videos I took on a trip.

I also can't think of a reason why I would ever want to look at an AI generated video.

however as they hint at a little in the announcement, if video generation becomes good enough at simulating physics and environments realistically, that's very interesting for robotics.