Hacker News

Y

Hacker News

new | ask | show | jobs

taf2 11 hours ago [ - ]

I’m waiting to see results on deepswe - that benchmark really seemed accurate for opus and gpt 5.5…