Texts in Gutenberg have 20GB, and full Wikipedia (English texts) have 80-110GB.
So to LLM-generate 6.5GB of tiny stories is quite a permutation in action :)
Texts in Gutenberg have 20GB, and full Wikipedia (English texts) have 80-110GB.
So to LLM-generate 6.5GB of tiny stories is quite a permutation in action :)