Hacker News

> That's not what I got.

My Opus vs your Opus, which is smarter?!

LLMs can't access the training data that's less than the statistically most common token, so they use a random jitter.

With that randomness comes statistically irrelevant results.