The whole point of the thesis is that because the cover image are very similar, therefore LLMs are bad at writing text?

I think it's that today's LLMs have access to poor/generic image generation models and people find it easier to ask ChatGPT or NanoBanana to make a cover instead of fine tuning a small SD model for the purpose.

The people in the FediVerse discussions have also looked at the book contents.

* https://mastodonapp.uk/@JdeBP/116788511790947929

* https://hachyderm.io/@ariels/116788498255660876