Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via   Lightweight Erasers

Chi-Pin Huang; Kai-Po Chang; Chung-Ting Tsai; Yung-Hsuan Lai; Fu-En; Yang; Yu-Chiang Frank Wang

arXiv:2311.17717·cs.CV·July 19, 2024·2 cites

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

Chi-Pin Huang, Kai-Po Chang, Chung-Ting Tsai, Yung-Hsuan Lai, Fu-En, Yang, Yu-Chiang Frank Wang

PDF

Open Access 1 Repo

TL;DR

Receler introduces a lightweight method for reliably erasing specific concepts from text-to-image diffusion models, ensuring robustness against paraphrased prompts while preserving the model's ability to generate unrelated images.

Contribution

The paper proposes a novel lightweight Eraser with regularization and adversarial learning for reliable concept erasure in diffusion models, outperforming previous approaches.

Findings

01

Receler achieves superior concept erasure performance.

02

The method maintains image generation quality for non-target concepts.

03

Experiments validate robustness against paraphrased prompts.

Abstract

Concept erasure in text-to-image diffusion models aims to disable pre-trained diffusion models from generating images related to a target concept. To perform reliable concept erasure, the properties of robustness and locality are desirable. The former refrains the model from producing images associated with the target concept for any paraphrased or learned prompts, while the latter preserves its ability in generating images with non-target concepts. In this paper, we propose Reliable Concept Erasing via Lightweight Erasers (Receler). It learns a lightweight Eraser to perform concept erasing while satisfying the above desirable properties through the proposed concept-localized regularization and adversarial prompt learning scheme. Experiments with various concepts verify the superiority of Receler over previous methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jasper0314-huang/Receler
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare

MethodsDiffusion