This is an interesting test for LLMs.
- ChatGPT was able to solve it in 6 seconds.
- Opus exhausted my entire days token limit on the problem & didn’t solve it.
This is an interesting test for LLMs.
- ChatGPT was able to solve it in 6 seconds.
- Opus exhausted my entire days token limit on the problem & didn’t solve it.
Is it? Or has ChatGPT read the answer before.
As an end user, does it even matter?
You just want the answer.
It matters if you're going to call it an "interesting test".