Erasing with Precision: Evaluating Specific Concept Erasure from Text-to-Image Generative Models
Masane Fuchi, Tomohiro Takagi

TL;DR
This paper introduces EraseEval, a comprehensive evaluation framework for assessing the effectiveness of concept erasure in text-to-image generative models, addressing limitations of subjective and inconsistent previous metrics.
Contribution
It proposes a novel, integrated evaluation method with three core criteria for more robust assessment of concept erasure techniques.
Findings
Baseline methods vary in effectiveness
Some methods fail to fully erase concepts
Future research directions are identified
Abstract
Studies have been conducted to prevent specific concepts from being generated from pretrained text-to-image generative models, achieving concept erasure in various ways. However, the performance evaluation of these studies is still largely reliant on visualization, with the superiority of studies often determined by human subjectivity. The metrics of quantitative evaluation also vary, making comprehensive comparisons difficult. We propose EraseEval, an evaluation method that differs from previous evaluation methods in that it involves three fundamental evaluation criteria: (1) How well does the prompt containing the target concept be reflected, (2) To what extent the concepts related to the erased concept can reduce the impact of the erased concept, and (3) Whether other concepts are preserved. These criteria are evaluated and integrated into a single metric, such that a lower score is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
