Human user usage data is probably a tiny contribution to improvement of the models--it's mostly RL on environments