> while being only marginally better.

It's only marginally better in the things it's actually comparable to. A\ models are MUCH better in many more things; eg: things Kimi/etc. didn't distill.

For those things the difference is like a cliff.

That's a baseless claim that borderline reads like shilling. Do you have any proof of that you wrote there?