TL;DR
EMMA is a comprehensive benchmark for evaluating concept erasure techniques in text-to-image models across multiple dimensions, revealing their limitations and biases.
Contribution
It introduces EMMA, a new benchmark with diverse metrics and challenging scenarios to rigorously assess concept erasure methods.
Findings
Existing methods struggle with implicit prompts and visually similar non-target concepts.
Some methods amplify gender and ethnicity bias after erasure.
EMMA provides a socially aware analysis of concept erasure techniques.
Abstract
The widespread adoption of text-to-image (T2I) generation has raised concerns about privacy, bias, and copyright violations. Concept erasure techniques offer a promising solution by selectively removing undesired concepts from pre-trained models without requiring full retraining. However, these methods are often evaluated on a limited set of concepts, relying on overly simplistic and direct prompts. To test the boundaries of concept erasure techniques, and assess whether they truly remove targeted concepts from model representations, we introduce EMMA, a benchmark that evaluates five key dimensions of concept erasure over 13 metrics. EMMA goes beyond standard metrics like image quality and time efficiency, testing robustness under challenging conditions, including indirect descriptions, visually similar non-target concepts, and potential gender and ethnicity bias, providing a socially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
