> while being only marginally better.
It's only marginally better in the things it's actually comparable to. A\ models are MUCH better in many more things; eg: things Kimi/etc. didn't distill.
For those things the difference is like a cliff.
> while being only marginally better.
It's only marginally better in the things it's actually comparable to. A\ models are MUCH better in many more things; eg: things Kimi/etc. didn't distill.
For those things the difference is like a cliff.
That's a baseless claim that borderline reads like shilling. Do you have any proof of that you wrote there?