Students Parrot Their Teachers: Membership Inference on Model Distillation
Matthew Jagielski, Milad Nasr, Christopher Choquette-Choo, Katherine, Lee, Nicholas Carlini

TL;DR
This paper investigates the privacy implications of model distillation, revealing that it offers limited protection against membership inference attacks and can be vulnerable even without direct queries on training data.
Contribution
The study introduces new membership inference attacks demonstrating that knowledge distillation provides limited privacy and can be compromised under certain conditions.
Findings
Distillation offers limited privacy protection.
Attacks succeed even without direct queries on training data.
Attacks are stronger when teacher and student data are similar or poisoned.
Abstract
Model distillation is frequently proposed as a technique to reduce the privacy leakage of machine learning. These empirical privacy defenses rely on the intuition that distilled ``student'' models protect the privacy of training data, as they only interact with this data indirectly through a ``teacher'' model. In this work, we design membership inference attacks to systematically study the privacy provided by knowledge distillation to both the teacher and student training sets. Our new attacks show that distillation alone provides only limited privacy across a number of domains. We explain the success of our attacks on distillation by showing that membership inference attacks on a private dataset can succeed even if the target model is *never* queried on any actual training points, but only on inputs whose predictions are highly influenced by training data. Finally, we show that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
MethodsKnowledge Distillation
