Are you expecting a distilled model to be sufficiently powerful to capture the watermark? I wouldn’t.
Additionally, I don’t think the watermark has to be deterministic.
Are you expecting a distilled model to be sufficiently powerful to capture the watermark? I wouldn’t.
Additionally, I don’t think the watermark has to be deterministic.