I wonder how many cycles of train->extract->train->extract->... you can do before most of your output will be hallucinations.

It would be an expensive experiment to perform.

I wonder how you could do it more efficiently?