And the Chinese models rip IP just like everyone else before them. Your argument is moot.

This was a problem for 5+ years ago. Nobody cares or at least the majority voice does not care across the world. Cat is out of the bag and there is no way to put it back in.

EDIT: Worth noting that I have long held the belief that if you put data out on the public sidewalk that you should have low to no expectation that it’s IP. It’s how I think about Google Maps data for example. If they want to reap the benefits by not walking it off the a user login than they can feel the pain if folks use that information. Same applies for media that has been bought, Reddit comments or any other datasets.

> And the Chinese models rip IP just like everyone else before them.

The difference is the Chinese models return to the same system that they feed from (indirectly); people get access to model weights even if the entire model isn't open source. The same can't be said for OpenAI, Anthropic, Google etc (who also benefit from Chinese models and train on them).

Further, Chinese models are significantly cheaper and the comapnies aren't hostile to their customers.

> Worth noting that I have long held the belief that if you put data out on the public sidewalk that you should have low to no expectation that it’s IP.

Except your beliefs aren't the cornerstone of modern jurisprudence. Why are models able to reliably produce replicas of Ghibli movies which go well beyond any example you listed?

[flagged]