Mining Data Impressions from Deep Models as Substitute for the Unavailable Training Data
Gaurav Kumar Nayak, Konda Reddy Mopuri, Saksham Jain, Anirban, Chakraborty

TL;DR
This paper introduces 'Data Impressions', synthetic data extracted from pretrained deep models' parameters, enabling various tasks like domain adaptation, continual learning, and adversarial robustness without access to original training data.
Contribution
It proposes a novel method to generate proxy training data from model parameters, facilitating multiple applications in data privacy scenarios.
Findings
Effective in unsupervised domain adaptation
Enhances continual learning performance
Generates data-free universal adversarial perturbations
Abstract
Pretrained deep models hold their learnt knowledge in the form of model parameters. These parameters act as "memory" for the trained models and help them generalize well on unseen data. However, in absence of training data, the utility of a trained model is merely limited to either inference or better initialization towards a target task. In this paper, we go further and extract synthetic data by leveraging the learnt model parameters. We dub them "Data Impressions", which act as proxy to the training data and can be used to realize a variety of tasks. These are useful in scenarios where only the pretrained models are available and the training data is not shared (e.g., due to privacy or sensitivity concerns). We show the applicability of data impressions in solving several computer vision tasks such as unsupervised domain adaptation, continual learning as well as knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
