>The biggest trap is the hallucinated citation

The "biggest problem" being the one thing that is trivial to verify against concrete databases is a bit convenient don't you think?

I think it's more likely that it makes mistakes evenly but the one thing that you are able to check with certainty is the only place you discover the errors.

I've made the same experience with programming AI. It is very convenient, but convenient doesn't mean unlikely. The universe appears to have given us a convenient thing here.