Show me a coherent video that lasts more than 5 seconds and was generated with the model and maybe I'll start to care.