I love your idea and would enjoy seeing the results of that controlled experiment.
I'm also interested in the broader impact of using LLMs in place of web search for general Q&A when we want 'to know things'. It's pretty clear the way LLMs are being used for knowledge acquisition now is often less accurate while 'feeling' more certain. Even if we set aside explicit hallucinations, I suspect it's still less accurate.