Would be interesting to see coding performance on SWE benchmarks.