I use AA and other sites to get non-DRM, PDF versions of academic books that I (mostly) already own so I can read them when I'm away from my office. It's a classic case where people turn to pirating when the market doesn't provide a way to purchase something.

Same thing with movies. Ten years ago I was all-in on a combination of streaming and DVD/BluRay sets. The market has completely collapsed for me with region locking and overly aggressive DRM. So, I've started pirating those again as well when it's not possible to get through another route.

Sure, but the difference here is the pirate is claiming it's "their data" and asking for donations.

Well, it is their data.

The word "their" is overloaded, it could mean "thing I have the legal right to", or, "thing I have in my possession right now".

The latter condition is clearly true. It's their data.

If you pretend the other definitions of possession don't exist and claim "aktually it's not theirs they don't have rights to it" then that's on you for faking an incomplete understanding of language.

Well, but if it’s the latter definition, then the AI didn’t train on their data, since the companies took possession of that data before doing a training run.

It’s only the former definition that would allow an AI model to have been trained on someone else’s data

> It’s only the former definition that would allow an AI model to have been trained on someone else’s data

There are yet more definitions of "theirs". For example, data whose provenance can be traced back to Anna's Archive.

So the data is legally owned by the book authors, possessed by Anna's Archive, and downloaded for training usage by the AI companies. Every person in that chain could, linguistically speaking, correctly refer to the data as "theirs", or refer to the data of a different entity as "theirs".

It's their servers sure, but if you download something under a license that doesn't grant you ownership, then it isn't yours.

You are being granted a license to use the data.

Yes, exactly, if you ignore all definitions of "yours" that involve possession then it isn't "yours".

But no one else is obligated to ignore the definitions of words that you're choosing to ignore, so the rest of us will go on saying it's their data.

If you steal my car, no who knows it's stolen would say it's "yours".

We're not talking abstract language concepts, this is a specific case. The data was taken without license/rights/approval. It's stolen. AA calling it "our data" is disingenuous. Legally it isn't theirs. While you could use "ours"/"theirs" loosely in English, they knew it wasn't true in a legal sense when publishing this.

Taking someone else's car illicitly is theft, because theft means taking with intent to deprive the rightful owner of it. Copying can never be theft, only moving can be theft, because only moving it could deprive the rightful owner of it. An illicit copy is merely copyright infringement or a breach of contract or various other concepts that are not theft despite people sometimes using that word as shorthand. It's YOUR illicit copy, not the rightful owner's illicit copy.

I didn't "steal" your passwords, I just "copied" them. I don't know what you're getting so upset about, you still have your list of passwords, and the fact that my changing all your accounts' passwords rendered that list worthless did nothing to move it.

It means whatever is convenient. If you are looking to monetize knowledge you would use it like "your car", half way your books are just books you've purchased a copy of, at the other end your car is now mine.

I found an abandoned bicycle 10 years ago. I have since replaced nearly all parts of it. I would give it back if you can prove it is yours but who owns the bicycle of theseus is more of an opinion.

I refer to it as my bicycle.

> If you steal my car, no who knows it's stolen would say it's "yours".

The chop shop well might.

Or, if I steal your car, and then go on to use it daily for the next 10 years, at some point everyone I know will refer to it as "my" car even if they're all entirely aware it was stolen.

> they knew it wasn't true in a legal sense when publishing this

I'm not sure why you're expecting the operators of a pirate site to use legally rigorous terms to refer to themselves in a blog post. This is an error in your expectations, not their terminology.

> The data was taken without license/rights/approval. It's stolen.

That's incorrect. A license violation isn't theft. Theft deprives others of their property, that's not what's going on here. Intellectual property is a fictional "ownership" that provides value to society, but it is much newer and different than the actual ownership of property.

No one actually owns a collection of words or ideas or thoughts.

Yet the main holders of this position were caught saying "our data". Don't you see the irony?

Guess what, the AI companies training their models aren't going to include themselves in the "rest of us"

The AI companies training their models are going to refer to it as their own data, once it's on their servers.

"but if you download something under a license that doesn't grant you ownership, then it isn't yours."

Possession is 9/10 of the law - if you have a copy, you have possession, and thus you have SOMETHING and LEGALLY it is considered yours (now whether you legally obtained it is a different story and THAT is where charges stem from.)

Random nit, the original saying was "possession is 9 points of the law", attributes that strengthened legal claims, rather than a percentage. Things like possession, good lawyer, money, patience, witnesses, for which if you had the object in your possession were likely to be in your favor.

Their data about not their work

This was the whole premise of Steam. Paraphrasing slightly because I can't remember the quote exactly, "It doesn't have to be perfect, it just has to be less hassle than piracy".

Even Youtube is no longer less hassle than piracy now.

Spotify is always my example. Spotify (and Apple Music I assume) is far more convenient, for a modest price, than pirating music.

It’s a shame the TV and movie people can’t seem to learn this. Most music is available on Spotify and Apple and probably other places as well.

They toyed with exclusivity for a while and I’m sure there’s still some stuff that’s exclusive to one or the other, but any time I hear a song and look it up, it’s on Spotify. Done.

Such a contrast to the stupid game of figuring out which streaming service has the show I want.

Most of the music i listen to doesnt exist on Spotify and I think their business model is very predatory against artists. most artists cant pay their bills with Spotify fees, they just need to be on there to get visibility for their actual revenue streams.

I think a better example is bandcamp - it’s actually sustainable for artists and just as convenient as pirating. Plus you get to actually own what you pay for as opposed to Spotify controlling what you can / cant listen to.

Music is very different to TV and movies. You only watch a show or a movie once, maybe twice. And it costs much more to produce it.

The biggest difference there isn't production costs, but the physical costs of maintaining the giant library, in a way that is reasonable streamable at a good cost from any device, with many dubbings, and even video differences per version. Go see how many little differences are there in a random Pixar movie due to localization. The infrastructure per hour watched is relevant, and there's a lot of differences between one is willing to spend on something that is being watched hundreds of thousands of times today, and some 30 year old episode of a series nobody followed. It's a much different production than sending music files over.

Even with licensing costs at zero, the infra of Youtube, the closest thing to Spotify for video, is a very different beast. And I'd argue youtube doesn't go far enough.

This sounds reasonable, but it doesn't seem to reflect reality. The biggest reason that shows are region locked and/or removed from streaming sites are licensing deals, not technical reasons. Movie and TV production companies are the ones pushing for the region locks, and the ones selling limited distribution rights to streaming services.

So, while you are right that video streaming is much more costly than audio streaming, I think GP is overall more correct about the reasoning being production costs rather than anything to do with distribution.

Maybe there's an opportunity for a media host to farm out data for preservation by clients (end users' computers) - what I'm thinking is torrent essentially, where the data-unit is a scene (or a series of frames between n key-frames). Clients get access to that show if they agree to store m chunks. The media repo can sell access whilst only keeping a copy in cold-storage because you can 'popcorn time' the show from the pool of user-clients.

Reduced hot-storage, increased playlist. Sort of media communism but the capitalists still hold the keys?

This can never be legal. When I worked in media streaming the copyright owners were very specific about what we were allowed to store, and wouldn't allow unencrypted files to be transmitted to any other companies.

[dead]

> Spotify is always my example. Spotify (and Apple Music I assume) is far more convenient, for a modest price, than pirating music.

streaming services do provide some conveniences over manually managing one's own library of music. i feel like "far more" is a sales pitch argument more than something that describes reality (ignoring whether you pirate or legally acquire digital music). i recently cancelled my streaming music service subscription and returned to manually managing my music. i spend maybe one day a week shuffling music on and off of my phone according to what i want to listen to in the moment. i don't really miss being able to call up any song in the world at any point - i make a note to add it to my phone next time i sync and then move on. if i simply have to play something that's not currently on my phone, i can usually find it on bandcamp or youtube without having to pay for a stream or two.

i know it's not for everybody (and trust me, apple doesn't make it particularly easy to do compared to signing up for Apple Music), but it's really not much work to manage your own music and doing so comes with some benefits you forget about when you assume you can and should have instantaneous, frictionless access to most recorded music.

Except that Spotify is now becoming enshittified (battery and UI). When I have to think too much to attempt to use a UI, its time to find alternatives.

As opposed to streaming video services, which, aside from the content they provide, have been shit from day one.

While the web UIs suck compared to local media players, they work well enough that I can cope.

But most services restrict 4K (and at least historically 1080p) web playback, even on Windows with a GPU that supports top-tier hardware DRM and an HDCP display.

My desktop display is a recent 55" LG OLED smart TV, and the streaming service apps on the TV work fine when my attention is devoted to whatever I'm watching, even if they tend to be slightly shittier than the already mediocre web UIs.

But when task switching or multitasking, my only options are reduced video quality, borrowing or purchasing a physical copy if available, or piracy.

Given how quickly everything shows up on public torrent trackers, I struggle to understand why the 4K limitations remain in place, as it obviously doesn't stop whoever uploads the torrents, and there has to be a vanishingly small number of paying customers who'd prefer to crack DRM locally or record HDMI instead of simply downloading the torrent.

Do streaming services get kickbacks from smart device vendors?

IIRC the interview that quote was from came with the story - Russia was seen as a lost cause by the game industry, there was so much piracy that nobody even bothered trying to give legitimate ways to purchase, why invest in distribution when they’ll just pirate? Now of course Steam does heathy business there so that’s obviously not true. But indicates writing off piracy is a self fulfilling prophecy

Not anymore they don't.

Putin's 3 day special military operation has been going on for 4 year and 3 months, btw.

Steam is still accessible in Russia btw. Sometimes it's spotty, but it's because of Russia's own restrictions, Valve itself is happy to keep doing business there.

Does Valve just have an internal Russian entity that processes with a domestic payment processor, then?

All of the international payment processors (ie, anyone piggybacking off Visanet) are in compliance with the sanctions.

> We think there is a fundamental misconception about piracy. Piracy is almost always a service problem and not a pricing problem. If a pirate offers a product anywhere in the world, 24 x 7, purchasable from the convenience of your personal computer, and the legal provider says the product is region-locked, will come to your country 3 months after the US release, and can only be purchased at a brick and mortar store, then the pirate’s service is more valuable.

https://www.escapistmagazine.com/Valves-Gabe-Newell-Says-Pir...

Aren't most piracy services free though nowadays? This quote is at least referencing pirates that sell the pirated content.

Original interview with Gabe: https://youtube.com/watch?v=EQweFurRz4g

> Even Youtube is no longer less hassle than piracy now.

YouTube premium is hassle?

I think he means that you can’t watch regular videos on YouTube unless you use a IP that is easily traceable to a subscriber or a YouTube account that requires everything short of a DNA sample to be valid.

I don't see any hassle with youtube, but I'm willing to pay.

I do see hassle on things like disney and iplayer, which put now put adverts for shows I don't want to watch in front of Rivals. It's fortunately very rare that happens (on Disney), but its getting close to what I did when Amazon brought that in, and cancelled my subscription. Just like I stopped buying DVDs when they brought adverts in.

I wouldn't have any moral problem in downloading Rivals from piratebay though, as far as I'm concerned I'm paying for it.

But sometimes though there's no option to buy the thing. I want to buy the audio version of "a stitch in time" by Andrew Robinson (Garak from Star Trek).

It's not available in my country on audible -- only the German translation.

I haven't acquired it via other means yet, I'm still on the look out for another supplier which will take my money, and if I can trust that's a legitimate supplier so at least some of my money goes to the copyright holder (and thus pays for the people that create it)

I don't have a CD player so not much use, but technically it is available for £142 from "Paper Cavalier UK". That's second hand, the creator won't make any money from me doing that.

To my mind if someone won't "shut up and take my money", it's acceptable to acquire via another means.

since youtube premium and various methods to skip ads now even Joe rogan who has 200+ million dollars does ad reads directly in video.

That’s not a problem with YouTube, that’s a problem with the content creator. YouTube Premium accounts actually pay out more per watch than free users, and YouTube also provides a Skip Ahead button that will appear at the start of most ad reads (it’s a bit hit or miss, I think it relies on data from other people scrubbing past them).

YouTube could ban ad reads that aren't tagged, then Premium accounts could get no ads. I guess they're worried that tags would leak and allow 3rd party solutions (like SponsorBlock) to skip more easily.

YouTube could not give less of a shit about people skipping in-video ads, since they don't get paid for those anyway.

It's all about playing the incentive structure. When the party who can stop you from doing something is different from the party who wants to stop you from doing it, nobody will stop you from doing it.

sure but if youtube wanted to, they could force the creators to tag these sections themselves so they are 100% accurate and have an option for the paying customer to skip these automatically. it is within their power

You might be interested in the SponsorBlock[1] browser extension for Firefox and Chromium based browsers. It deals with this issue, and is open source.

[1] https://github.com/ajayyy/SponsorBlock

I love SponsorBlock so much.

  >You've saved people from 21,262 segments (5d 18h 50.7 minutes of their lives)
  >
  >You've skipped 3522 segments (1d 5h 17.4 minutes)
Not just for skipping ads, but also pointless filler like intros and engagement reminders.

I hope someone makes an AI-Block addon, to filter out slop channels based on the same crowd sourcing principle. It's gotten so bad I rarely venture beyond that channels I'm already subscribed to, because those are pre-sloppocalypse.

The guy got his start on NewsRadio and I always wonder how much that influenced his path today.

https://en.wikipedia.org/wiki/NewsRadio