Mass Concept Erasure in Diffusion Models with Concept Hierarchy
Jiahang Tu, Ye Li, Yiming Wu, Hanbin Zhao, Chao Zhang, Hui Qian

TL;DR
This paper introduces a hierarchical concept erasure method for diffusion models that groups related concepts to improve efficiency and preserves generation quality, especially for supertype concepts, using a novel low-rank adaptation technique.
Contribution
It proposes a supertype-subtype hierarchy for concept erasure and introduces SuPLoRA, a low-rank adaptation method that maintains generation quality during concept suppression.
Findings
Group-wise concept erasure improves efficiency.
SuPLoRA mitigates degradation of supertype generation.
New benchmark challenges models to erase diverse concepts.
Abstract
The success of diffusion models has raised concerns about the generation of unsafe or harmful content, prompting concept erasure approaches that fine-tune modules to suppress specific concepts while preserving general generative capabilities. However, as the number of erased concepts grows, these methods often become inefficient and ineffective, since each concept requires a separate set of fine-tuned parameters and may degrade the overall generation quality. In this work, we propose a supertype-subtype concept hierarchy that organizes erased concepts into a parent-child structure. Each erased concept is treated as a child node, and semantically related concepts (e.g., macaw, and bald eagle) are grouped under a shared parent node, referred to as a supertype concept (e.g., bird). Rather than erasing concepts individually, we introduce an effective and efficient group-wise suppression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Topic Modeling
