Hacker News

oofbaroomf 6 days ago [ - ]

I'm not really bullish on OpenAI. Why would they only compare with their own models? The only explanation could be that they aren't as competitive with other labs as they were before.

greenavocado 6 days ago [ - ]

See figure 1 for up-to-date benchmarks https://github.com/KCORES/kcores-llm-arena

(Direct Link) https://raw.githubusercontent.com/KCORES/kcores-llm-arena/re...

gizmodo59 6 days ago [ - ]

Apple compares against its own products most of the times.

kcatskcolbdi 6 days ago [ - ]

I don't mind what they benchmark against as long as, when I use the model, it continues to give me better results than their competition.

poormathskills 6 days ago [ - ]

Go look at their past blog posts. OpenAI only ever benchmarks against their own models.

oofbaroomf 6 days ago [ - ]

Oh, ok. But it's still quite telling of their attitude as an organization.

rvnx 6 days ago [ - ]

It's the same organization that kept repeating that sharing weights of GPT would be "too dangerous for the world". Eventually DeepSeek thankfully did something like that, though they are supposed to be the evil guys.