> If LLMs are not derivative works of the training data then why is so much training data needed?

If you went to school for 12-16 years, that's a lot of training. Does that mean anything you produce is a derivative work?