Hacker News

zone411 a year ago [ - ]

Confabulations are decreasing with newer models. I tested confabulations based on provided documents (relevant for RAG) here: https://github.com/lechmazur/confabulations/. Note the significant difference between GPT-4 Turbo and GPT-4o.

a year ago [ - ]

[deleted]

christianqchung a year ago [ - ]

Is 3% supposed to be significant? Or did you mean 4 Turbo and 4o mini?

zone411 a year ago [ - ]

It is significant because of the other chart that shows MUCH lower non-response rates for GPT-4o.

tkgally a year ago [ - ]

That’s very interesting! Thanks for the link.