Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks
Piyush Tiwary, Atri Guha, Subhodip Panda, Prathosh A.P

TL;DR
This paper introduces 'Adapt-then-Unlearn,' a novel two-stage method for unlearning undesired features in pre-trained GANs while preserving sample quality, addressing privacy concerns without access to original training data.
Contribution
The paper presents the first high-fidelity GAN unlearning method leveraging parameter space semantics, combining adaptation and regularization to effectively remove undesired features.
Findings
Effective unlearning of undesired features demonstrated on multiple datasets.
Maintains high sample quality after unlearning.
Theoretical insights support the method's effectiveness.
Abstract
Owing to the growing concerns about privacy and regulatory compliance, it is desirable to regulate the output of generative models. To that end, the objective of this work is to prevent the generation of outputs containing undesired features from a pre-trained Generative Adversarial Network (GAN) where the underlying training data set is inaccessible. Our approach is inspired by the observation that the parameter space of GANs exhibits meaningful directions that can be leveraged to suppress specific undesired features. However, such directions usually result in the degradation of the quality of generated samples. Our proposed two-stage method, known as 'Adapt-then-Unlearn,' excels at unlearning such undesirable features while also maintaining the quality of generated samples. In the initial stage, we adapt a pre-trained GAN on a set of negative samples (containing undesired features)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Digital Media Forensic Detection
MethodsFocus
