Great idea. Very limited execution. If they release the source data and question set, I'll repeat with more LLMs to flesh out the findings.
Great idea. Very limited execution. If they release the source data and question set, I'll repeat with more LLMs to flesh out the findings.