Both Opus 4.6 and GPT-5.3 one shot a Gameboy emulator for me. Guess I need a better benchmark.
How does that work? Does it actually generate low level code? Or does it just import libraries that do the real work?
I just one shot a Gameboy emulator by going to Github and cloning one of the 100 I can find.
How does that work? Does it actually generate low level code? Or does it just import libraries that do the real work?
I just one shot a Gameboy emulator by going to Github and cloning one of the 100 I can find.