Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models
Lexiang Xiong, Chengyu Liu, Jingwen Ye, Yan Liu, Yuecong Xu

TL;DR
Semantic Surgery is a training-free, zero-shot method for concept erasure in diffusion models that improves completeness and locality without sacrificing image quality, enhancing safety in text-to-image generation.
Contribution
It introduces a novel, training-free framework that performs dynamic, zero-shot concept erasure directly on text embeddings, outperforming existing methods in robustness and quality.
Findings
Achieves 93.58 H-score in object erasure
Reduces explicit content to just 1 instance
Attains 8.09 H_a in style erasure without quality loss
Abstract
Concept erasure in text-to-image diffusion models is crucial for mitigating harmful content, yet existing methods often compromise generative quality. We introduce Semantic Surgery, a novel training-free, zero-shot framework for concept erasure that operates directly on text embeddings before the diffusion process. It dynamically estimates the presence of target concepts in a prompt and performs a calibrated vector subtraction to neutralize their influence at the source, enhancing both erasure completeness and locality. The framework includes a Co-Occurrence Encoding module for robust multi-concept erasure and a visual feedback loop to address latent concept persistence. As a training-free method, Semantic Surgery adapts dynamically to each prompt, ensuring precise interventions. Extensive experiments on object, explicit content, artistic style, and multi-celebrity erasure tasks show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
