Hacker News

cyanydeez 7 hours ago [ - ]

likely the small model makes whatever fuzzer they designed to poke the gpus much faster optimizations.

they seem to think it scales up because theyre shortening the stack.