This is at least murky, since a lot of pirated material is “publicly available”. Certainly some has ended up in the training data.
This is at least murky, since a lot of pirated material is “publicly available”. Certainly some has ended up in the training data.
It isn't? You have to break the law to get it. It's publicly available like your TV is if I were to break into your house and avoid getting shot.
That isn't even remotely a sensible analogy. Equating copyright violation with stealing physical property is an extremely failed metaphor.
One of the craziest experiences in this "post AI" world is to see how quickly a lot of people in the "information wants to be free" or "hell yes I would download a car" crowds pivoted to "stop downloading my car, just because its on a public and openly available website doesn't make it free"
Maybe you have some legalistic point that escapes comprehension, but I certainly consider my house to be much private and the internet public.