Not dumb at all. It's a whole field of active research - Speculative Decoding. A recent paper goes one level deeper with Speculative Speculative Decoding - https://arxiv.org/abs/2603.03251

Is model distillation also related?

Oh man awesome! I’m so S-M-R-T

Compression is such an interesting field