Hacker News

ahoog42 21 hours ago [ - ]

at what point do model providers optimize for the "pelican riding a bicycle" test so they place well on Simon's influential benchmark? :-)

hansonkd 21 hours ago [ - ]

They almost certainly are, even if unknowingly, because HN and all blogs get piped continuously into all models' training corpus.

simonw 21 hours ago [ - ]

See https://simonwillison.net/2025/Nov/13/training-for-pelicans-...

mudkipdev 17 hours ago [ - ]

Why is the assumption that they trained for a pelican on a bicycle, rather than running RL for all kinds of 'generate an SVG' tasks?

simonw 15 hours ago [ - ]

Gemini did exactly that, and boasted about it at launch: https://x.com/JeffDean/status/2024525132266688757

acchow an hour ago [ - ]

That post doesn't say anything about training for SVG generation

simonw an hour ago [ - ]

https://blog.google/innovation-and-ai/models-and-research/ge...

> Code-based animation: 3.1 Pro can generate website-ready, animated SVGs directly from a text prompt. Because these are built in pure code rather than pixels, they remain crisp at any scale and maintain incredibly small file sizes compared to traditional video.