Which SOTA LLM fails at tic-tac-toe?

I don't know, but it's not a hard test, get the LLM to play a perfect game of tic-tac-toe against itself, look at the output and see if it goes wrong.