Anthropic probably trained Mythos on their own code and found that it is too got at reproducing it.

I doubt that. Why would you train Mythos on its own code if you don't want it to be able to reproduce it? It's not going to add much to the overall corpus.

Synthetic training data has been the name of the game since years ago.