Unified Concept Editing in Diffusion Models

Rohit Gandikota; Hadas Orgad; Yonatan Belinkov; Joanna Materzy\'nska,; David Bau

arXiv:2308.14761·cs.CV·October 25, 2024

Unified Concept Editing in Diffusion Models

Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzy\'nska,, David Bau

PDF

Open Access 1 Repo 1 Video

TL;DR

Unified Concept Editing (UCE) offers a single, training-free approach to simultaneously address bias, copyright, and offensive content issues in text-to-image diffusion models, improving safety and scalability.

Contribution

The paper introduces UCE, a novel closed-form, training-free method for concurrent concept editing in diffusion models, unifying multiple safety-related modifications.

Findings

01

Effective simultaneous debiasing, style erasure, and content moderation.

02

Scalable to multiple concurrent edits in diffusion models.

03

Outperforms prior methods in efficacy and scalability.

Abstract

Text-to-image models suffer from various safety issues that may limit their suitability for deployment. Previous methods have separately addressed individual issues of bias, copyright, and offensive content in text-to-image models. However, in the real world, all of these issues appear simultaneously in the same model. We present a method that tackles all issues with a single approach. Our method, Unified Concept Editing (UCE), edits the model without training using a closed-form solution, and scales seamlessly to concurrent edits on text-conditional diffusion models. We demonstrate scalable simultaneous debiasing, style erasure, and content moderation by editing text-to-image projections, and we present extensive experiments demonstrating improved efficacy and scalability over prior work. Our code is available at https://unified.baulab.info

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rohitgandikota/unified-concept-editing
pytorchOfficial

Videos

Unified Concept Editing in Diffusion Models· youtube

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Internet Traffic Analysis and Secure E-voting

MethodsDiffusion