for recent books, they could buy digital version of the books and use them for training, though.

I was wondering about this, but digital versions are typical DRM-encumbered and actually a license (not a true purchase) whose terms probably don't allow this. The court's decision was that training is fair use, but in practice, it seems many avenues are blocked.

It reminds of the theoretically public beaches that are blocked off by privately owned land.

DRM is irrelevant. That's only if you want to efficiently extract the text.

If you point a camera at an ebook reader with a little motor to tap the screen, "next" that's still easier than scanning physical books.

The reason why companies aren't using ebooks is because all the publishers and ebook companies make you click through a license stating that "this book for personal use" (paraphrased).