Efficiency gains can be used to make existing models more profitable, or to make new larger and more intelligent models.

Some yes, others no. Distillation and quantization can't be used to make new base models since they require a preexisting one.

it enables models larger than was previously possible.

No because the base model from which the distilled or quantized models are derived is larger.