it enables models larger than was previously possible.

No because the base model from which the distilled or quantized models are derived is larger.