Hacker News

Show HN: I built an 11-LLM consensus engine to detect AI hallucination

6 points by jaquelinejaque 15 hours ago | 9 comments

Genuine question, how did you get to these 11 LLMs instead of 10 or 12? I'm interested in understanding how you did benchmark these 11 LLMs or whether it was an arbitrary ensemble you selected.

r0fl 6 hours ago [ - ]

Interesting idea

I get codex to use openrouter api and ask it to find 5 cheap but highly efficient LLMs at the task that km doing based on benchmarks and descriptions

I then run the query through all 5, get a markdown file for each in case I want to read through it later and have codex analyze and improve things based on those 5 outputs

It’s very easy and can scale to 11 or more LLMs with the same api

jaquelinejaque 3 hours ago [ - ]

[flagged]

jmtrevarton 8 hours ago [ - ]

Does the user set up API keys for those 11 LLMs or is API cost included in the product? Do you test for tool hallucination or only information hallucination?

jaquelinejaque 7 hours ago [ - ]

[flagged]

Lionga 11 hours ago [ - ]

Problem: We have AI Slop.

Solution: Lets make MORE AI Slop and hope it goes away somehow.

AI Psychosis in full swing.

jaquelinejaque 11 hours ago [ - ]

[flagged]

jaquelinejaque 3 hours ago [ - ]

[flagged]

jaquelinejaque 15 hours ago [ - ]

[flagged]