It's possible that they purchased the movies (although definitely without the proper licensing; buying a DVD allows for personal use, not training a commercial model), or maybe they simply pirated them.
It's also possible that models' entire understanding of the aesthetic comes from screenshots of the movie. Even if OpenAI didn't feed in each frame of the movie, they definitely fed in lots of images from the web, some of which were almost certainly movie screenshots.