I truthfully cannot think of a single model that satisfies your criteria.
And if we wait for the the internet to be wholly eaten by AI, if we accept perfect as the enemy of good, then we'll have nothing left to cling to.
> And the question is also pretty clear: did $company steal other peoples work?
Who the hell cares? By the time this is settled - and I'd argue you won't get a definitive agreement - the internet will be won by the hyperscalers.
Accept corporate gifts of AI, and keep pushing them forward. Commoditize. Let there be no moat.
There will be infinite synthetic data available to us in the future anyway. And none of this bickering will have even mattered.