Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Chao Gong; Kai Chen; Zhipeng Wei; Jingjing Chen; Yu-Gang Jiang

arXiv:2407.12383·cs.CV·October 29, 2024·1 cites

Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Chao Gong, Kai Chen, Zhipeng Wei, Jingjing Chen, Yu-Gang Jiang

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper presents RECE, a fast and reliable method for erasing inappropriate concepts from text-to-image diffusion models in just 3 seconds, with minimal impact on the model's original capabilities.

Contribution

RECE introduces a novel closed-form approach for concept erasure that is both highly efficient and effective, requiring no additional fine-tuning.

Findings

01

Achieves concept erasure in 3 seconds

02

Maintains most of the original generation ability

03

Shows improved robustness against red-teaming tools

Abstract

Text-to-image models encounter safety issues, including concerns related to copyright and Not-Safe-For-Work (NSFW) content. Despite several methods have been proposed for erasing inappropriate concepts from diffusion models, they often exhibit incomplete erasure, consume a lot of computing resources, and inadvertently damage generation ability. In this work, we introduce Reliable and Efficient Concept Erasure (RECE), a novel approach that modifies the model in 3 seconds without necessitating additional fine-tuning. Specifically, RECE efficiently leverages a closed-form solution to derive new target embeddings, which are capable of regenerating erased concepts within the unlearned model. To mitigate inappropriate content potentially represented by derived embeddings, RECE further aligns them with harmless concepts in cross-attention layers. The derivation and erasure of new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

charlesgong12/rece
pytorchOfficial

Models

🤗
ChaoGong/RECE
model· ♡ 5
♡ 5

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Image Retrieval and Classification Techniques · Topic Modeling

MethodsDiffusion