Data Redaction from Pre-trained GANs

Zhifeng Kong; Kamalika Chaudhuri

arXiv:2206.14389·cs.LG·January 19, 2023

Data Redaction from Pre-trained GANs

Zhifeng Kong, Kamalika Chaudhuri

PDF

Open Access

TL;DR

This paper introduces post-training algorithms for redacting specific data from GANs, enabling the models to avoid generating undesirable samples efficiently without full retraining.

Contribution

The work presents novel post-editing methods for GANs to perform data redaction, distinct from data deletion, improving efficiency and effectiveness.

Findings

01

Algorithms outperform data deletion baselines

02

Redaction maintains high generation quality

03

Methods are computationally efficient

Abstract

Large pre-trained generative models are known to occasionally output undesirable samples, which undermines their trustworthiness. The common way to mitigate this is to re-train them differently from scratch using different data or different regularization -- which uses a lot of computational resources and does not always fully address the problem. In this work, we take a different, more compute-friendly approach and investigate how to post-edit a model after training so that it ''redacts'', or refrains from outputting certain kinds of samples. We show that redaction is a fundamentally different task from data deletion, and data deletion may not always lead to redaction. We then consider Generative Adversarial Networks (GANs), and provide three different algorithms for data redaction that differ on how the samples to be redacted are described. Extensive evaluations on real-world image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare · AI in cancer detection