> There's a common rebuttal to this, and I hear it constantly. "Just wait," people say. "In a few months, in a year, the models will be better. They won't hallucinate. They won't fake plots. The problems you're describing are temporary."

To some extent, the reason models will get better is because companies will hire PhDs to train them on increasingly complex problems.

The problem is that more complex problems take longer to train, more time to test, require more compute, and are harder to verify. This is why “just make it bigger” is a losing proposition imo.

A lot of what I just said is also true for RLVR.