Funny how people are suddenly on Elsevier's side. It's clear to me that AI training is transformative fair use under existing law. Maybe this will be the case to prove it.
Funny how people are suddenly on Elsevier's side. It's clear to me that AI training is transformative fair use under existing law. Maybe this will be the case to prove it.
I find it grating that so many AI boosters try to frame pushing back against the AI industry as a sudden about-face for everyone that spent the last 20 years pushing back against the copyright industry. I’m also in favor of decriminalizing or legalizing small amounts of pot for personal use. That doesn’t mean I’m behind industrialized narcotic production on such a huge scale that it that it starts to distort the economy, and companies looking for new ways to add methamphetamine to every goddamn product.
>I find it grating that so many AI boosters try to frame pushing back against the AI industry as a sudden about-face for everyone that spent the last 20 years pushing back against the copyright industry.
What do you think the outcome of tightening fair use is going to be? Do you think its going to be most effectual against these big evil AI companies we are meant to fear? Or is it going to end up putting more individual creators on the end of Disneys pitchforks?
Like if you support creating a gun to kill a monster, that's great. But you need to understand that weapons rarely only target the person you want them to. And its unlikely that any bill that specifically targets a certain size or profit margin is going to make it all the way into law without being generalised to the approval of large IP holders.
Its much much (much) better to look at this as an opportunity to erode IP laws for everyone, than to make them worse and hope that your particular enemies are the only ones that are affected.
>That doesn’t mean I’m behind industrialized narcotic production on such a huge scale that it that it starts to distort the economy, and companies looking for new ways to add methamphetamine to every goddamn product.
Thats such a non sequitur. This isnt a weed legalisation argument, its "Do we make IP worse for everyone, because you dont like some people benefiting from fair use".
One could imagine a different legal standards for recreational, research, and commercial uses.
> One could imagine a different legal standards for recreational, research, and commercial uses.
Meta used allegedly stolen copyrighted materials to train a model they shared for free with the whole world. Is this a recreational use?
No it is not recreational use. And no, they are not freely sharing it. It is use to build a monopoly, make hones competition impossible and plan charge as much possible on it.
It is the same playbook everytime. We dont have to be naive and pretend meta is doing something for other peoples benefit.
>And no, they are not freely sharing it
Are you unable to access this page?
https://www.llama.com/llama-downloads/
Or this one?
https://lmstudio.ai/models/meta/llama-3.3-70b
>It is use to build a monopoly
How?
>We dont have to be naive and pretend meta is doing something for other peoples benefit.
Meta benefits from the current war of open model competition, but we also benefit from it. In particular, participating in all this makes it hard for them to pull the ladder up when the market changes. They will have to justify why whatever new hotness is better than these existing models already on our hard drives.
Tell me more about these methamphetamine products. Inquiring minds would like to know!
It would be disingenuous framing because the argument against copyright stems from a belief that information should be free. Meta does not do things in this spirit. There's no about face needed...
> It would be disingenuous framing because the argument against copyright stems from a belief that information should be free. Meta does not do things in this spirit.
Don't they? They release the llama model weights, they do things like this:
https://www.opencompute.org/wiki/Open_Rack/SpecsAndDesigns
They also make significant contributions to Linux and are the originators of popular open source projects like zstd and React.
They make their money from selling ads, not selling licenses.
They only released the weights because someone leaked them.
Someone leaked the llama 1 weights before they were released. That doesn't explain why they would release the subsequent versions except that they wanted to.
Speaking of ai and meth, have you seen videos of the palantir CEO Alex karp? Dude looks like he's regularly getting the same meth shots Hitler used to get.
But I hear you. One of my biggest tells that someone can't be reasoned with is when they resort to whataboutism without any consideration for how 2 situations can actually be different even if there is some commonality. It's a powerful bad faith argument technique. When that style of argument comes up I nod my head and walk away. Some people are just doomed.
[flagged]
I am not s copyright maximalist, but I would tell you be careful of a world where copyright and IP is meaningless. Might as well let any other country/company one shot your entire industry.
Slippery slope, false dilemma, etc. What other fallacies do you have in your utility belt, batman?
How did you know I was Bruce wayne?
Where's my goddamn electric car Bruce?
BYD sells them for super cheap.
I also find it funny, I said this regarding the other thread and article[0]
'"They then copied those stolen fruits"
How are these fruits "stolen" if they still have what was allegedley stolen?
Dowling v. United States, 473 U.S. 207 (1985): The Supreme Court ruled that the unauthorized sale of phonorecords of copyrighted musical compositions does not constitute "stolen, converted or taken by fraud" goods under the National Stolen Property Act
And even if, arguendo, sure its stolen. The purpose of copyright is to "To promote the Progress of Science and useful Arts, by securing for limited Times to Authors and Inventors the exclusive Right to their respective Writings and Discoveries"
And you would be hard pressed to prove that LLM's haven't advanced the arts and sciences, so at bare minimum transformative, ie fair use.'
[0] https://news.ycombinator.com/item?id=48026207#48029072
>How are these fruits "stolen" if they still have what was allegedley stolen?
If you write a book and I take it and embed its knowledge into my product that is so pervasive that no one needs to buy your book any more (and I don't even credit you so no one knows where that knowledge came from), to you really still have what was stolen? And I didn't even buy a copy of your book to copy it.
> If you write a book and I take it and embed its knowledge into my product that is so pervasive that no one needs to buy your book any more (and I don't even credit you so no one knows where that knowledge came from), to you really still have what was stolen?
The trouble with this analogy is that it proves too much.
Suppose you write a book, and so does someone else, but they have better marketing than you and then people in the market for that genre buy theirs instead of yours. Let's even stipulate that the existence of their book actually lowers your sales, because people who want that kind of book already bought theirs by the time they find out about yours and then some people don't have time to read or can't afford to buy both.
Notice that we haven't yet said a word about the contents of either book. They could be completely independent and they've never even heard of you or your book -- they "didn't even buy a copy of your book to copy it". All we know is that they're the same genre and the existence of theirs is costing you sales. By that logic all competition would thereby be "stealing", and that can't be right.
Which implies that you don't have a property right to the customers.
I like your argument, not because it is a good analogy for AI but because it is a good contrast. Copyright isn't a guarantee or magic force field blocking fair competition. It is a permeable buffer against lazy knockoffs and time-boxed so that buffer doesn't choke all future creativity.
People on this thread need to focus on what "derivative" and "fair use" mean and understand both are measured on a somewhat fuzzy spectrum, subject to interpretation.
In a perfectly fair world AIs/MLs could vacuum up all human knowledge, fair and square. (In an ideal world, they would do that adhering to polite opt-in/opt-out agreements with copyright holders. We can dream). Input isn't theft.
On output, two magic genies would stand at the gate, the Derivative Genie and Fair Use Genie and review anything spat out by the AI/ML. If it crossed agreed upon thresholds the Genies would bar the gates and issue a stern warning to prompt again (or maybe the AL/ML would auto-adjust the prompt and try again).
So, if your prompt asked for a 300-word poem about thrash metal mosh pit dancing and it spat out a poem where 85% of it match one of the handful of available mosh pit poems in its database, the Derivative Demon would block the output and raise an alarm.
On the other hand, if you asked for a line by line analysis of a famous mosh pit dancing poem (by name) or maybe asked for a satirical spoof of said poem, the Fair Use Demon would overrule the Derivative Demon and give the output a pass.
That's as fair as this could get, especially if you add one more thing: An Appeals Court (maybe corporate, maybe 3rd party, maybe state run) with a Settlement Pool. If a copyright holder could prove the Genies let pass something they shouldn't, the AL/ML would fix that. If real damage is done, the creator would get a settlement from the pool.
The point is that the Input Genie is out of the bottle. Creators just look foolish trying to squeeze it back in. Better, they should focus on making the output Genies and the Appeals process as effective and fair as possible for everyone.
A better analogy would be that you do original research or work and produce a valuable book. Somebody else looks at your work, decides it has value, and reproduces it in a new book under their name. The new book is cheaper, or easier to find, or for whatever reason displaces your original book created through your own research and investment. Now somebody else is profiting off your creativity or work, without payment or even acknowledgement.
I'm not sure how this plays out legally, but it certainly seems unethical
So for example, when Disney sees value in public domain stories like Cinderella, Rapunzel/Tangled or Snow White, and they make movies out of them, profiting from the creativity and work of the Brothers Grimm without paying anything to their estate, or high school plays do Shakespeare, that seems unethical to you?
Would it be fair for Greece to do retroactive term extensions all the way back to Plato and then sue anyone who copies the idea of having a university or uses the Platonic solids or distributes religious texts that incorporate the dualistic theory of the soul?
Your examples, as you say, are all public domain. Are all the works we train LLMs on public domain too? Was the original book in my analogy in the public domain? What do you think about training on material that isn't yet in the public domain?
You're framing this as an ethical question, but copyright term lengths are essentially arbitrary. They're set by the government, as are the boundaries of fair use. At which point you're making a circular argument. That it's bad if it's illegal and that it should be illegal because it's bad. So what happens if someone argues the opposite? That it's not unethical if it's fair use and then it should be fair use because it's not unethical.
Why are you talking about this case that case nothing to do with the topic at hand? The comment you’re replying to gives a very clear and narrow analogy, and you’re talking about something else.
How is it something else? It's the same analogy. The problem with it is that the harm from the alleged theft doesn't require any use of the original material in order to happen, since that "harm" is competition rather than expropriation.
The attempt to distinguish them is through copying, but that's the part that isn't depriving anyone of anything.
The main point here is _using_ copyrighted materials to create a commercial product, that you then sell, that may be used as alternative or substitute for the original materials. You’re missing that point and talking about two independent projects competing.
Because the competition is the only source of alleged harm, but people can do that even if they don't copy anything. There isn't actually a property right to the customers. You can lose sales to someone else whether they copied anything or not.
So what that you can loose sales even without crimes being committed? This somehow makes it okay to profit off someone’s work and ignore licenses?
What if I read your book (and a bunch of other books), and use what I learned to write my own book? Have I "stolen" your book?
Facts are not copyrightable. Only your particular way of expressing those facts is copyrightable.
Yes. That's not to say that something damaging wasn't done, but nothing was stolen. Stealing/theft requires deprivation of property. It's like receiving a normal nonlethal punch in the face and calling it murder. Murder requires someone dying.
> Theft [...] is the act of taking another person's property or services without that person's permission or consent with the intent to deprive the rightful owner of it. --- https://en.wikipedia.org/wiki/Stealing
My God, I can't believe chodes are still playing this "how many angels can you fit on the head of a pin" navel gazing semantic argument. Thirty years at least, it was all you saw on fin de ciecle Slashdot from anyone with a six-digit UID. No one cares about your hyper literalist meaning of "theft," that's not the goddamn point. Christ, this place looks like Reddit more and more.
This isn't a court of law. We don't have to talk like lawyers. If you replaced "theft" with "copyright infringement" in the comment you had such a problem with, what meaningfully changes besides we all have about five additional brain cells?
Even the case for copyright infringement is weak. LLMs are not copying machines, we already have copying machines at much lower price, almost zero, and perfect fidelity and much faster than generating it probabilistically. So it makes no economic sense to spend billions on training and inference to make a copier. In fact the value of LLMs is where they do not copy but apply knowledge a new situation.
> If you replaced "theft" with "copyright infringement" in the comment you had such a problem with, what meaningfully changes besides we all have about five additional brain cells?
The obvious difference that copyright is subject to fair use and various other limitations that personal property isn't.
Ever hear of Aaron swartz?
Aaron Swartz was charged under the CFAA, which isn't even copyright law, and the prosecution was widely condemned as draconian overreach.
>> Stealing/theft requires deprivation of property
maybe you should look up the definition of property, which is a set of legally recognized rights over a thing, typically including:
* possession (what you're focusing on)
* use
* exclusion
* transfer
The last 3 seem like they have been breached, in legally that's theft.
Violation of these rights may be criminal without meeting the strict legal definition of theft.
This can even extend to stealing physical property.
Depending on local laws, stealing a car may not actually be theft if the defendent can prove they intended to return it before the owner got home from work, though it would certainly be considered theft in the colloquial sense of the term, and they would still be guilty of a lesser offense like civil and/or criminal conversion.
> Depending on local laws, stealing a car may not actually be theft if the defendent can prove they intended to return it before the owner got home from work
I doubt there's even one place where the law works like that.
> I doubt there's even one place where the law works like that.
In a lot of places, that's how it works. A key element of theft is the intent to permanently deprive someone of property.
This is why joyriding isn't classified as auto theft and is instead a lesser offense. It's because joyriding is an intent to temporarily deprive, while GTA is an intent to permanently deprive.
In some jxns (the UK is one), there is a tort called trespass to goods, and an example of this would be "stealing" someone's property to deliver to another location for them to use there. The tort of conversion is similar: interference with someone's property right to treat it as your own (silent as to length of time).
Yea in the us if someone tries to steal your car and you are in it or threatened by it you can shoot them dead or something like that (ianal) You may have a court day but in many situations no punishment will follow.
Theft is not the breach of any property right. It's specifically the deprivation of property without consent. Yes, I have checked the definition in my jurisdiction.
Getting punched in the face also violates rights, yet isn't murder. Murder is specifically about dying.
You forget that laws are made by people and at anytime they can change interpretations are arbitrary, roe vs wade today but not tomorrow.
People seem to think what ai is today is theft. If enough people agree, it will be theft. Big companies dont like this and push the other way. An objectiveness doesnt exist here. It is too wiggly
You’re splitting hairs over a definition that isn’t relevant here (theft and copyright infringement are different things) to defend something that even you agree is bad.
It isn't splitting hairs. The damages are completely different in nature.
With theft, the entire damage is the deprivation. It could be an heirloom or some other object that may have been entrusted to you, something that can never be replaced, memorabilia of loved ones. Something that you may have needed in your posession to survive (e.g. a car to go to your job).
With a given copyright violation, the damage is that maybe[1] you made less profit than you could have. The potential for profit is not property. Profit isn't guaranteed.
[1] The loss is not certain, because there's no guarantee that the ones consuming the copyrighted content could have even afforded it.
Cool cool cool. So all the code and data you send to anthropic and chatgpt should be mass distributable to forward other peoples arts and science? All your meeting notes with ai summarizers, slack chats with bots? Might as well put your entire company and all plans for it on github mit licensed. Ill take a peek, see if there's anything valuable to me in that. Don't worry you can keep it all on your github too. It's still yours afterall. Copilot will be training on it too though btw
That's a privacy violation, not relevant.
No it's not. You exposed that data to an LLM. Should have read the fine print. The laws around that don't make sense to me anymore so therefore I own that stuff now. That's how this works right? You do know chatgpt etc can read everything you write, right?
Also social media profile pics. Great way to get faces for deep fake ads. Most people are just 1 phone call away from being voice cloned. Our likeness isn't all that important either if you think about it.
Maybe meta will clone your writing style and sign into your meta account and message your friends telling them about this awesome new product. Meta owns the account and you uploaded data to it.
Literally none of these things are defensible positions, so nobody will take you seriously.
Many of the things I wrote are already happening. The others probably are but haven't been reported yet.
I think Anthorpic has pledged to not use team and enterprise user's data for training purposes. I don't mind if they do some verification or whatever as long as it doesn't end up in the responses it gives others.
I have an amazing timeshare for sale and you seem like someone who would really see the opportunity this provides. How are your financials?
What Silicon Valley company over a decade old has respected the limitations on using data that they agreed to? At least any valuable data.
yes yes and google pledged "don't be evil"
Don't be naïve. A corporation would tear the flesh from your body if it meant a better quarterly earnings report.
Having seen someone die at work, this is factual. The comments made during and after were eye opening.
You were swiftly corrected about your misunderstanding under your original comment. Reposting it here, removing the quote farther from its context, and hoping to not be downvoted again is very weird!
I don't see how me quoting the actual complaint the news was about, in both threads, was me being swiftly corrected. If you where to base it on upvotes then this one shows I'm right and you got swiftly corrected here. In both cases it was relevant as both threads where not yet merged and about the same complaint. And held two positons on front page and I was adding to the discourse.
>It's clear to me that AI training is transformative fair use under existing law.
I wouldn't even go that far. Its an entirely new product. Its like the guy who sold you the keyboard demanding royalties for the software you built.
That the person who wrote the book couldn't predict a new use case for the book in training LLMs, is irrelevant. The book isn't in the LLM. Its not being sold with the LLM. Its one of billions of tools used to create the LLM.
People try and sell this as the AI companies extracting value from the poor little IP holders like Disney. Its maddening. That content is your cultural heritage. It already belongs to you, just some idiot has been granted a lifetime of exclusive exploitation. An LLM is trained on data you already own. Disney et al wants to exploit the new technology to extract even more money out of stuff created often decades ago.
At absolute worst its reverse engineering, which was supposed to be fair use protected in the US but apparently that's been somewhat eroded.
> The book isn't in the LLM.
An LLM is essentially a lossy compression of the training data. The book absolutely is in there, it’s just mangled to the point of unrecognizability.
The wood tends to have an impression of the hammer that hits it. The book isn't in there, the weights are just shaped by what tools were used to form it.
When large quantities of source material are replicable by prompting its a bug not a feature.
That's just semantics. The wood would be there without the hammer, the LLM wouldn't be here without the copyrighted works it's based on.
No, thats just semantics.
>The LLM Wouldnt be here without the copyrighted works
Google wouldn't be here if it hadn't scraped every copyrighted website and used them to form a searchable graph of the internet but we only hear complaints about them when they reproduce entire news articles.
If my book isn’t in your LLM, then prove it and don’t use my book to train your LLM.
>don’t use my book to train your LLM.
What makes you think you are entitled to tell people what they can and cant do with data they purchased (or otherwise acquired) from you. Extremely honest question. I just cant put myself in your shoes.
Like if I had written anything useful I would be overwhelmingly flattered that my content be considered so worthy for inclusion.
Your profile suggests that you are a philosopher. Did you get into philosophy hoping to exploit the publishing industry to the extent that you can squeeze every cent out of your thoughts, and deny their potential uses downstream?
Its actually crazy how bad things are, I am usually keen on capitalism and exclusivity, but the whole thing with LLMs, I see people pushing hard to tighten the grip of intellectual property. I see people making 50 cents a month on Kindle Unlimited suddenly shocked that someones LLM generated output might be ever so slightly influenced by weights ever so slightly influenced by their work, seemingly thinking they might get some big payday out of it.
Give me a tiny little wedge of understanding of your thought process. Your book is right now, doing a greater social good on your behalf than me running around and removing all the trash from my neighborhood, and the benefits of that social good are going to accrue long after you and I are gone. Your work is now going to live on, in a very tiny way, in these systems forever. I am honestly envious.
If anything, I would be trying to get bad writing removed from LLM training data. Things that I dont want to influence others. But as a potentially honest promoter of your work, you want it removed?
Whats the number? If not 1:1 exactly what you charge for the book, what do you think the proper compensation for slightly influencing training weights you should receive?
> What makes you think you are entitled to tell people what they can and cant do with data they purchased
Hundreds of years of copyright law. I bought a copy of Windows, but I’m not allowed to modify that data with a cracker and sell a bootleg DVD of it.
I should edit to clarify that I’m not a big fan of Lars Ulrich or Disney, but I don’t think we’re going to get a win here for the recreational IP pirates. What’s more likely is that we’ll end up with some Frankenstein law that favors both Mikey Mouse and OpenAI, and you and I will neither get free movies nor the ability to earn a living off of our creative labor.
I mean, the comparable situation would be, being allowed to sell something you created on Windows.
But in abstract you should absolutely be able to modify and sell windows.
To continue your analogy, I had to pay for Windows before I was allowed to create something with it, or acquire a license for under terms they set forth. If AI companies stopped at the public domain, then my argument wouldn't really hold up, but they didn't do that. They acquired everyone's copyrighted works without regard for the license and now they're, in the most charitable interpretation, using them to create derivative works.
And before you give me an analogy about how someone could listen to Pink Floyd and then produce works inspired by their influence yada yada: Someone is a human being with human rights, and if we're going to start pretending that training an LLM is in any way analogous to human consumption and creativity, and not an industrial process that encodes input data into a digital artifact, then let's start by saying LLMs have human rights and cannot be owned by a company that charges for access to them.
>To continue your analogy, I had to pay for Windows before I was allowed to create something with it, or acquire a license for under terms they set forth.
Yep and so far it looks like the issue with the meta case is they didnt pay for the book. Not that they used it in training data.
>in the most charitable interpretation, using them to create derivative works.
Yeah in the same way I use a hammer to create a derivative table.
>Someone is a human being with human rights, and if we're going to start pretending that training an LLM is in any way analogous to human consumption and creativity.
I dont care about that. Its simply a tool being built using existing tools. Like using a jigsaw to make a step ladder.
> Yep and so far it looks like the issue with the meta case is they didnt pay for the book. Not that they used it in training data.
Let's not sane-wash what they did here, they didn't just 'forgot to pay for the books', they deliberately and illegally downloaded and used material that wasn't theirs to use.
If you or I did that, we would be jailed or sued into destitution. In a fair world we either should change copyright laws (allowing for anyone to freely pirate all media), or Zuckerberg needs to go to jail.
>Let's not sane-wash what they did here, they didn't just 'forgot to pay for the books', they deliberately and illegally downloaded and used material that wasn't theirs to use.
Yes. Forgot is your word.
But lets face it, there wouldn't be a case to answer for if they had paid retail for each book, torn them up and scanned them and trained on that data.
>Zuckerberg needs to go to jail.
I am comfortable with that but would prefer updating copyright.
A million dollars please.
It’s called a copyright notice. Same as a license. If you’re running a commercial business you can’t legally just take that piece of work and reuse it. Pick any book off your shelf and pretty well every one of them will have words to the effect of:
All rights reserved. No part of this publication may be reproduced, distributed, or transmitted in any form or by any means, including photocopying, recording, or other electronic or mechanical methods, without the prior written permission of the publisher, except in the case of brief quotations embodied in critical reviews and certain other noncommercial uses permitted by copyright law. For permission requests, write to the publisher, addressed "Attention: Permissions Coordinator," at the address below.
Same as every piece of commercial software has a license which has to be abided by. Same as use of Meta’s service has terms and conditions which HAVE to be agreed to.
So yeah they’re free to break that license but they’re also free to be sued by IP holders for breaking it at scale.
Well its not a solved issue in terms of law. But even still, I would have expected you to understand that I wasnt speaking legally.
Illegally obtaining copyrighted materials is usually the issue not the transformation part
Looking at the complaint ( https://publishers.org/wp-content/uploads/2026/05/2026-05-05... ), that seems like the part that's got the most solid foundation, especially given that while torrenting the books, they were also seeding to other peers.
The items they call out around training the models (and attempting to claim that each subsequent model generation should count as an additional instance of infringement) seem far less grounded in the current court interpretations of AI training.
Absorb all "our" IP without consent, in doing so remove "our" own source of revenue, and then repackage it as their own product. Not really fair use IMO.
How does that work? Is it a kind of infringement without substantial similarity?
I find it hard to think of a reasonable analogy. But it's like coming into your house, stealing all your belongings, and then building a new house with all your shit inside and then selling it back to you.
I think this completely misses the point... the point is that Meta pirated the media they used to train their model.
I am not a fan of US copyright law, but if I torrented millions of books, I would be facing a felony charge in criminal court and a (with statutory damages as high as $150,000 per title in cases of willful infringement) multi-billion dollar lawsuit in civil court.
In my opinion, this has nothing to do with whether or not AI training is transformative and this fair use, and everything to do with whether or not the laws apply to everyone equally. If Facebook isn't forced to pay billions and elect a sacrificial executive to serve prison time, then I will remain angry.
> It's clear to me that AI training is transformative fair use under existing law. Maybe this will be the case to prove it.
That is not what this case is about. It is more about the illegal violation and piracy of copyrighted content done by Meta for commercial use and Zuck knew they were doing it.
Why did Anthropic settle [0] with a multi-billion dollar payout to authors after commercializing their LLMs that was trained off of copyrighted content that was illegally obtained and kept without the authors permission?
There's a reason why they (Anthropic) did not want it to go to trial. (Anthropic knew they would lose and it would completely bankrupt them in the hundreds of billions.)
AI boosters will do anything to justify the mass piracy and illegal obtainment of copyrighted material for commercial use (not research) which that is not fair use in the US. There is no debate on this. [0]
[0] https://images.assettype.com/theleaflet/2025-09-27/mnuaifvw/...
I think copyright is far for being the most important aspect related to AI, it's geopolitical and economical. And even if it was the most important, there is only a case to be made for 1. that copy used to train models and 2. rare or induced regurgitation by targeted prompting.
The original work is not replicated identically, why would we replicate a work when it can be more easily seen in original or replaced with an alternative options online. We use AI to produce new outputs to new situations. We already have had drives and networking for plain copying.
If i could ask for a summary from an llm vs buy a book id go with the summary. That eats into commercial use and the supreme court case sided with Gerald Ford when a newspaper published a small gist of his autobiography because it ate into the sales
Every single Wikipedia article of a book or TV show has this summary. Ford should have lost.
Probably, Educational purposes is strong component of fair use doctrine
Yea nope. I like the full book without any loss of information. Even if I don't want to read the entire book. LLMs love to respond even when something is outside of their training set.
I think both Elsevier and the people that appropriate IP for training commercially deployed AIs purpose without the consent of the author(s) should be legal.
It's not settled law so I'm not sure how that's clear to you.
It actually depends on evilness of the company. Elsevier is just less evil that Zuckerberg and Meta, while publishers are even less problematic. I dont think there is anything funny in that.
Or anything to defend on Meta. If they go out of business, humanity profits.
Elsevier is shitty to people doing stuff that (imo) should be allowed. Meta is making money doing the same thing and not getting the same shittiness from Elsevier.
Elsevier at least works within the (admittedly broken) system, Meta does not.
When you use millions of copyrighted materials to bundle together to produce a commercial product, I wouldn’t call that a fair use. Especially when licensing of such material doesn’t explicitly allow that, the material wasn’t even purchased on consumer markets and your commercial product may be a competitor/analogue to the copyrighted material.
Not even going to all GPL stuff, that in a better world should have screwed all the slop companies
The enemy of my enemy, and all that.
I'm not on Elsevier's side, but I still think it's bullshit that giant companies are allowed to do things at a scale that I'd go to prison for.
That's always going to be true for the Capitalist class.
And yet I continue to rage against the dying of the light.
"Funny" is how dishonest snipes are framed. It such a common trope of internet quips, it's wearing me out. Can we please try to just format our disagreements without the snideness?
[dead]
Such a garbage take. This is not a parody or a critique. Mark Zuckerberg is not Weird Al Yankovic.