Forgetting is Competition: Rethinking Unlearning as Representation Interference in Diffusion Models
Ashutosh Ranjan, Vivek Srivastava, Shirish Karande, Murari Mandal

TL;DR
This paper introduces SurgUn, a novel method for concept unlearning in diffusion models that uses competition-based interference to better erase targeted concepts while preserving model capabilities.
Contribution
SurgUn applies distractor-conditioned gradient competition and pixel-grounded localization to improve concept unlearning, outperforming existing methods across multiple benchmarks.
Findings
SurgUn achieves a stronger erase-retain balance than baselines.
Diverse distractors and contrastive competition are essential for effective unlearning.
Localization helps limit collateral forgetting while maintaining related concepts.
Abstract
Deployed text-to-image diffusion models increasingly require post-hoc concept unlearning for copyright claims, artist opt-outs, safety updates, and protected-content mitigation without full retraining. A central challenge is erase-retain imbalance, aggressive updates suppress targets but damage shared capabilities, while conservative or anchor-based updates preserve quality yet leave concepts recoverable through related, compositional, paraphrased, or adversarial prompts. Inspired by retroactive interference, we propose SurgUn, which treats forgetting as controlled competition rather than direct deletion or one-to-one reassignment. SurgUn instantiates retroactive concept interference via distractor-conditioned gradient competition: target-gradient ascent weakens target-conditioned denoising or flow-matching behavior, while descent over a semantically diverse distractor set introduces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Cell Image Analysis Techniques
