You can do image+text as well (although maybe the results are better if you do raw image to prompted image to video?)