EMMA: Concept Erasure Benchmark with Comprehensive Semantic Metrics and Diverse Categories

Lu Wei; Yuta Nakashima; Noa Garcia

arXiv:2512.17320·cs.CV·April 1, 2026

EMMA: Concept Erasure Benchmark with Comprehensive Semantic Metrics and Diverse Categories

Lu Wei, Yuta Nakashima, Noa Garcia

PDF

1 Repo

TL;DR

EMMA is a comprehensive benchmark for evaluating concept erasure techniques in text-to-image models across multiple dimensions, revealing their limitations and biases.

Contribution

It introduces EMMA, a new benchmark with diverse metrics and challenging scenarios to rigorously assess concept erasure methods.

Findings

01

Existing methods struggle with implicit prompts and visually similar non-target concepts.

02

Some methods amplify gender and ethnicity bias after erasure.

03

EMMA provides a socially aware analysis of concept erasure techniques.

Abstract

The widespread adoption of text-to-image (T2I) generation has raised concerns about privacy, bias, and copyright violations. Concept erasure techniques offer a promising solution by selectively removing undesired concepts from pre-trained models without requiring full retraining. However, these methods are often evaluated on a limited set of concepts, relying on overly simplistic and direct prompts. To test the boundaries of concept erasure techniques, and assess whether they truly remove targeted concepts from model representations, we introduce EMMA, a benchmark that evaluates five key dimensions of concept erasure over 13 metrics. EMMA goes beyond standard metrics like image quality and time efficiency, testing robustness under challenging conditions, including indirect descriptions, visually similar non-target concepts, and potential gender and ethnicity bias, providing a socially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lobsterlulu/EMMA
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.