I'm looking forward to the trial where Anthropic will have to disclose sources of their training data, and then explain why they are entitled to charging customers for using regurgitated training data but Alibaba which trains their models on Anthropic's models are not.
Should be fun.
Edit: clarification
They already did and paid 1.5B https://authorsguild.org/advocacy/artificial-intelligence/wh...
That's only a fraction of the training data.
Quite amusing that the library of libgen is worth 1.5bil for unlimited access.
It's about the same valuation as bun, lol.
$3,000 per title.
Do you think many authors would give you rights to create derivative works en masse for that money?
For endlessly reselling the whole work verbatim? Well, where can I buy such a license in the real world, because then I would like to buy a couple of those!
Meta/Facebook got away with it though right?
That's a great cost-benefit ratio. Can you and I steal and do illegal things and pay the same cost?
Sure, but only if you get the same benefits
looks like we can't today. Man it would be great to figure out how to be above the law just like how these other rich people in different social classes are.
Being logically consistent isn’t as profitable as being aggressive and loud.
And if it includes at least one GPL source, they should release the weights on GPL license.
While I love the sentiment, I feel like the odds of this actually ever reaching a trial are low, given the international positioning of the parties, and the... um... complex relationships involved.
Anthropic's actions seem performative. Others have already speculated on the likely audience(s).
> While I love the sentiment, I feel like the odds of this actually ever reaching a trial are low ...
As cited in a peer comment here[0]:
Of note in the judge's finding; "the piracy was not fair use".0 - https://news.ycombinator.com/item?id=48667411
1 - https://authorsguild.org/advocacy/artificial-intelligence/wh...