"training good coding models" many would say that is a highly debatable statement, and some would say that is just flat out not true. Cursor has not trained a frontier model from scratch, what they did was take an already made (non-frontier) model and further trained it on their user data about coding outcomes from its coding agent. So, a form of distillation and RL.