I believe it is because of RL you are no longer limited by training data as you generate it during learning on the fly so benchmark driven learning could scale with compute
they also seem to assume that everyone will use AI from them in the future, especially with new "pulse" combined with ads. scaling this will need much more compute.
is this reasonable? I'm not convinced, but this is how I believe it's their reasoning