Would love to see benchmarks on cognition's FrontierCode