If the data is opensource on github, then in my opinion it should be fair game.

IMO this is unfair for GPL or similarly licensed code.

Seems ok for MIT like licensed code though

It's totally fair to use GPL code, it just means all the models built by Anthropic, OpenAI, etc. using GPL-licensed source are themselves bound by the GPL. Plus, any works created downstream using those AI tools.

We're on the verge of a golden age of software as soon as someone finds a court with courage.

Ah, you have much more faith in the legal system than I do. It's nice to dream, though.

I think AI will create an open source dark age. Gradually, we'll see a lot less new good open source code. A gradual shift back to the proprietary world. Simmilar to the 1950-1990 period.

Things being public should not be enough. just because someone leaked your medical information to the public via a data breach should not make it fair game. There should be some rules.

I feel that's a false dichotomy. The code on github is freely available for people to read and learn from, leaked medical data isn't.

I feel that's a flase dichotomy. The code visible on github is freely available for anyone to read and learn from.