All the model releases we've seen this year have only made incremental improvements in benchmarks.
This feels like the first release that feels like a significant step up in terms of benchmark results.
Can anyone make an educated guess what the secret sauce in the model architecture is between 4.8 and Fable?