Any quick impressions of o3 vs o1? We've got one inference in our product that only o1 has seemed to handle well, wondering if o3 can replace it.

They are replacing o1 with o3 in the UI, at least for me, so they must be pretty confident it is a strict improvement.