We need more coding benchmark score. Not sure that winning terminalbench 2.1 alone is a clear win over Fable/Mythos yet.
But they are the only ones who can benchmark, so the best and only benchmark will be the one where they win. It's just business baby.
But they are the only ones who can benchmark, so the best and only benchmark will be the one where they win. It's just business baby.