5.4 thinking says "Just right of center, immediately to the right of the HAM RADIO shack. Look on the dirt path there: the raccoon is the small gray figure partly hidden behind the woman in the red-and-yellow shirt, a little above the man in the green hat. Roughly 57% from the left, 48% from the top."
(I don't think it's right).
I tried
> please add a giant red arrow to a red circle around the raccoon holding a ham radio or add a cross through the entire image if one does not exist
and got this. I'm not sure I know what a ham radio looks like though.
https://i.ritzastatic.com/static/ffef1a8e639bc85b71b692c3ba1...
Also, the racoon it circled isn't in the original.
I love how perfectly this captures the difficulties of using generative AI for detection tasks.
Oh god yes, I've been trying to make a LLM Assisted Magic the Gathering card scanner... its been a hell of a time trying to get it to just OCR card names well....
Why would you use an LLM for OCR?
Because apparently that's what programming is and can only be these days...
Indeed. I suppose one way to ensure you can find Waldo in any image is to add it yourself.
That's excellent. I added it to my post: https://simonwillison.net/2026/Apr/21/gpt-image-2/#update-as...
hilarious - i tried and got the same thing.
there was a very large bear in the first image; when asked to circle the raccoon it just turned the bear into a giant raccoon and circled it.