So they can also keep models trained on the datasets? That seems pretty big too, unless the half life of models is so low it doesn't matter.

It's a separate suit being wages against Meta and OpenAI etc.

There's piracy, then there's making available a model to the public which can regurgitate copyrighted works or emulate them. The latter is still unsettled