The distillation you're talking about is about cutting the number of weights, it has nothing to do with extracting QAs from another model.
The distillation you're talking about is about cutting the number of weights, it has nothing to do with extracting QAs from another model.