Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning   in Generative Adversarial Networks

Piyush Tiwary; Atri Guha; Subhodip Panda; Prathosh A.P

arXiv:2309.14054·cs.LG·February 13, 2025·1 cites

Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks

Piyush Tiwary, Atri Guha, Subhodip Panda, Prathosh A.P

PDF

Open Access

TL;DR

This paper introduces 'Adapt-then-Unlearn,' a novel two-stage method for unlearning undesired features in pre-trained GANs while preserving sample quality, addressing privacy concerns without access to original training data.

Contribution

The paper presents the first high-fidelity GAN unlearning method leveraging parameter space semantics, combining adaptation and regularization to effectively remove undesired features.

Findings

01

Effective unlearning of undesired features demonstrated on multiple datasets.

02

Maintains high sample quality after unlearning.

03

Theoretical insights support the method's effectiveness.

Abstract

Owing to the growing concerns about privacy and regulatory compliance, it is desirable to regulate the output of generative models. To that end, the objective of this work is to prevent the generation of outputs containing undesired features from a pre-trained Generative Adversarial Network (GAN) where the underlying training data set is inaccessible. Our approach is inspired by the observation that the parameter space of GANs exhibits meaningful directions that can be leveraged to suppress specific undesired features. However, such directions usually result in the degradation of the quality of generated samples. Our proposed two-stage method, known as 'Adapt-then-Unlearn,' excels at unlearning such undesirable features while also maintaining the quality of generated samples. In the initial stage, we adapt a pre-trained GAN on a set of negative samples (containing undesired features)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Digital Media Forensic Detection

MethodsFocus