LLMs scrape Wikipedia all the time, or at least attempt to.

The data bundle doesn't help that at all.