EGLOCE: Training-Free Energy-Guided Latent Optimization for Concept Erasure
Junyeong Ahn, Seojin Yoon, Sungyong Baik

TL;DR
EGLOCE is a novel, training-free method that uses energy-guided latent optimization during inference to effectively erase specific concepts from diffusion model outputs, enhancing safety and control.
Contribution
It introduces a dual-energy framework for concept erasure that operates entirely at inference, avoiding costly re-training and preserving image quality.
Findings
EGLOCE outperforms existing methods in concept removal effectiveness.
It maintains high image quality and prompt alignment after erasure.
The approach is robust against adversarial attacks.
Abstract
As text-to-image diffusion models grow increasingly prevalent, the ability to remove specific concepts-mostly explicit content and many copyrighted characters or styles-has become essential for safety and compliance. Existing unlearning approaches often require costly re-training, modify parameters at the cost of degradation of unrelated concept fidelity, or depend on indirect inference-time adjustment that compromise the effectiveness of concept erasure. Inspired by the success of energy-guided sampling for preservation of the condition of diffusion models, we introduce Energy-Guided Latent Optimization for Concept Erasure (EGLOCE), a training-free approach that removes unwanted concepts by re-directing noisy latent during inference. Our method employs a dual-objective framework: a repulsion energy that steers generation away from target concepts via gradient descent in latent space,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
