Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation
Gaurav Kumar Nayak, Konda Reddy Mopuri, Anirban Chakraborty

TL;DR
This paper demonstrates that arbitrary, unrelated datasets can effectively be used for data-free knowledge distillation if they are balanced across target classes, simplifying the process compared to complex synthetic data methods.
Contribution
It introduces a simple approach using arbitrary datasets for data-free knowledge distillation and validates its effectiveness across multiple benchmark datasets.
Findings
Arbitrary datasets can effectively replace original training data in knowledge distillation.
Balanced class distribution in the transfer set is crucial for success.
The approach simplifies data-free distillation without complex generative models.
Abstract
Knowledge Distillation is an effective method to transfer the learning across deep neural networks. Typically, the dataset originally used for training the Teacher model is chosen as the "Transfer Set" to conduct the knowledge transfer to the Student. However, this original training data may not always be freely available due to privacy or sensitivity concerns. In such scenarios, existing approaches either iteratively compose a synthetic set representative of the original training dataset, one sample at a time or learn a generative model to compose such a transfer set. However, both these approaches involve complex optimization (GAN training or several backpropagation steps to synthesize one sample) and are often computationally expensive. In this paper, as a simple alternative, we investigate the effectiveness of "arbitrary transfer sets" such as random noise, publicly available…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
