Is 3% supposed to be significant? Or did you mean 4 Turbo and 4o mini?

It is significant because of the other chart that shows MUCH lower non-response rates for GPT-4o.