Hacker News

isomorphic_duck 21 hours ago [ - ]

If Claude Mythos and Fable 5 are the same underlying models just with different safeguards, I fail to see how TerminalBench has them at different scores.

sothatsit 20 hours ago [ - ]

Refusals, presumably.