Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion
Hongxu Yin, Pavlo Molchanov, Zhizhong Li, Jose M. Alvarez, Arun, Mallya, Derek Hoiem, Niraj K. Jha, Jan Kautz

TL;DR
DeepInversion is a novel method for synthesizing realistic images from trained neural networks without using real data, enabling data-free knowledge transfer, pruning, and continual learning.
Contribution
We propose DeepInversion, a technique to generate high-fidelity images from trained models without access to original training data, facilitating data-free neural network applications.
Findings
Synthesized images are highly realistic and diverse.
Effective for data-free network pruning and knowledge transfer.
Enables continual learning without real data.
Abstract
We introduce DeepInversion, a new method for synthesizing images from the image distribution used to train a deep neural network. We 'invert' a trained network (teacher) to synthesize class-conditional input images starting from random noise, without using any additional information about the training dataset. Keeping the teacher fixed, our method optimizes the input while regularizing the distribution of intermediate feature maps using information stored in the batch normalization layers of the teacher. Further, we improve the diversity of synthesized images using Adaptive DeepInversion, which maximizes the Jensen-Shannon divergence between the teacher and student network logits. The resulting synthesized images from networks trained on the CIFAR-10 and ImageNet datasets demonstrate high fidelity and degree of realism, and help enable a new breed of data-free applications - ones that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
MethodsBatch Normalization
