The act of training by itself has been ruled to be fair use over and over again, including for LLMs, and there isn't much debate left there.
The test for infringement is if the output is transformative enough, and that is what NYT vs OpenAI etc. are arguing.