TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models
Finn Carter

TL;DR
TRACE introduces a theoretically grounded and effective method for erasing specific concepts from diffusion models, ensuring undesirable content suppression while maintaining high-quality image generation.
Contribution
The paper presents a novel concept erasure technique combining formal theoretical conditions with a trajectory-aware fine-tuning process for diffusion models.
Findings
Achieves state-of-the-art concept removal performance
Outperforms existing methods like ANT, EraseAnything, and MACE
Maintains high generative quality after erasure
Abstract
Text-to-image diffusion models have shown unprecedented generative capability, but their ability to produce undesirable concepts (e.g.~pornographic content, sensitive identities, copyrighted styles) poses serious concerns for privacy, fairness, and safety. {Concept erasure} aims to remove or suppress specific concept information in a generative model. In this paper, we introduce \textbf{TRACE (Trajectory-Constrained Attentional Concept Erasure)}, a novel method to erase targeted concepts from diffusion models while preserving overall generative quality. Our approach combines a rigorous theoretical framework, establishing formal conditions under which a concept can be provably suppressed in the diffusion process, with an effective fine-tuning procedure compatible with both conventional latent diffusion (Stable Diffusion) and emerging rectified flow models (e.g.~FLUX). We first derive a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Face recognition and analysis
MethodsDiffusion
