Great idea. Very limited execution. If they release the source data and question set, I'll repeat with more LLMs to flesh out the findings.