I'm impressed with the Arc-AGI-2 results - though readers beware... They achieved this score at a cost of $13.62 per task.
For context, Opus 4.6's best score is 68.8% - but at a cost of $3.64 per task.
I'm impressed with the Arc-AGI-2 results - though readers beware... They achieved this score at a cost of $13.62 per task.
For context, Opus 4.6's best score is 68.8% - but at a cost of $3.64 per task.