Zero-shot Knowledge Transfer via Adversarial Belief Matching
Paul Micaelli, Amos Storkey

TL;DR
This paper introduces a data-free adversarial method for transferring knowledge from a teacher to a student neural network, achieving competitive results without access to training data.
Contribution
It presents a novel zero-shot knowledge transfer technique using adversarial generation to match teacher predictions without data or metadata.
Findings
Successfully distills on simple datasets like SVHN.
Improves state-of-the-art in few-shot CIFAR10 distillation.
Introduces a metric for belief matching near decision boundaries.
Abstract
Performing knowledge transfer from a large teacher network to a smaller student is a popular task in modern deep learning applications. However, due to growing dataset sizes and stricter privacy regulations, it is increasingly common not to have access to the data that was used to train the teacher. We propose a novel method which trains a student to match the predictions of its teacher without using any data or metadata. We achieve this by training an adversarial generator to search for images on which the student poorly matches the teacher, and then using them to train the student. Our resulting student closely approximates its teacher for simple datasets like SVHN, and on CIFAR10 we improve on the state-of-the-art for few-shot distillation (with 100 images per class), despite using no data. Finally, we also propose a metric to quantify the degree of belief matching between teacher…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
