Comprehensive Assessment and Analysis for NSFW Content Erasure in Text-to-Image Diffusion Models
Die Chen, Zhiwen Li, Cen Chen, Xiaodan Li, Jinyan Ye

TL;DR
This paper systematically evaluates 11 state-of-the-art concept erasure methods for NSFW content in text-to-image diffusion models, analyzing their effectiveness across multiple perspectives to guide safer deployment.
Contribution
It provides the first comprehensive benchmark and analysis of concept erasure techniques for NSFW content in T2I diffusion models, including evaluation metrics and insights.
Findings
Certain erasure methods effectively reduce NSFW content but may impact image quality.
Explicit prompts influence the robustness of NSFW content erasure.
Evaluation reveals trade-offs between erasure effectiveness and semantic preservation.
Abstract
Text-to-image (T2I) diffusion models have gained widespread application across various domains, demonstrating remarkable creative potential. However, the strong generalization capabilities of these models can inadvertently led they to generate NSFW content even with efforts on filtering NSFW content from the training dataset, posing risks to their safe deployment. While several concept erasure methods have been proposed to mitigate this issue, a comprehensive evaluation of their effectiveness remains absent. To bridge this gap, we present the first systematic investigation of concept erasure methods for NSFW content and its sub-themes in text-to-image diffusion models. At the task level, we provide a holistic evaluation of 11 state-of-the-art baseline methods with 14 variants. Specifically, we analyze these methods from six distinct assessment perspectives, including three conventional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsComputational and Text Analysis Methods · Text and Document Classification Technologies · Ideological and Political Education
MethodsDiffusion
