Hacker News

film42 4 days ago [ - ]

Everyone wants to go the agent route until the agent messes up once after working 99 times in a row. "Why did it make a silly mistake?" We don't know. "Well, let's put a few more guard rails around it." Sounds good... back to "workflows."

film42 4 days ago [ - ]

"But what about having another agent that quality controls your first agent?"

You should watch the CDO-squared scene from the Big Short again.

dhorthy 4 days ago [ - ]

THIS so much. People are like "why human supervision when we can have agent supervsion" and always respond

> look if you don't trust the LLM to make the thing right in the first place, how are you gonna PROBABLY THE SAME LLM to fix it?

yes I know multiple passes improves performance, but it doesn't guarantee anything. for a lot of tool you might wanna call, 90% or even 99% accuracy isn't enough

dhorthy 4 days ago [ - ]

Yup