Forgetting is Competition: Rethinking Unlearning as Representation Interference in Diffusion Models

Ashutosh Ranjan; Vivek Srivastava; Shirish Karande; Murari Mandal

arXiv:2603.00975·cs.LG·May 19, 2026

Forgetting is Competition: Rethinking Unlearning as Representation Interference in Diffusion Models

Ashutosh Ranjan, Vivek Srivastava, Shirish Karande, Murari Mandal

PDF

TL;DR

This paper introduces SurgUn, a novel method for concept unlearning in diffusion models that uses competition-based interference to better erase targeted concepts while preserving model capabilities.

Contribution

SurgUn applies distractor-conditioned gradient competition and pixel-grounded localization to improve concept unlearning, outperforming existing methods across multiple benchmarks.

Findings

01

SurgUn achieves a stronger erase-retain balance than baselines.

02

Diverse distractors and contrastive competition are essential for effective unlearning.

03

Localization helps limit collateral forgetting while maintaining related concepts.

Abstract

Deployed text-to-image diffusion models increasingly require post-hoc concept unlearning for copyright claims, artist opt-outs, safety updates, and protected-content mitigation without full retraining. A central challenge is erase-retain imbalance, aggressive updates suppress targets but damage shared capabilities, while conservative or anchor-based updates preserve quality yet leave concepts recoverable through related, compositional, paraphrased, or adversarial prompts. Inspired by retroactive interference, we propose SurgUn, which treats forgetting as controlled competition rather than direct deletion or one-to-one reassignment. SurgUn instantiates retroactive concept interference via distractor-conditioned gradient competition: target-gradient ascent weakens target-conditioned denoising or flow-matching behavior, while descent over a semantically diverse distractor set introduces…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Cell Image Analysis Techniques