Main takeaways:

- Coding accuracy improved dramatically

- Handles 1M-token context reliably

- Much stronger instruction following