It’s not the same. Presumably public domain works are much more frequently shared on the public internet and therefore much more common in the training set