Roots Beneath the Cut: Uncovering the Risk of Concept Revival in Pruning-Based Unlearning for Diffusion Models

Ci Zhang; Zhaojun Ding; Chence Yang; Jun Liu; Xiaoming Zhai; Shaoyi Huang; Beiwen Li; Xiaolong Ma; Jin Lu; Geng Yuan

arXiv:2603.06640·cs.CV·March 10, 2026

Roots Beneath the Cut: Uncovering the Risk of Concept Revival in Pruning-Based Unlearning for Diffusion Models

Ci Zhang, Zhaojun Ding, Chence Yang, Jun Liu, Xiaoming Zhai, Shaoyi Huang, Beiwen Li, Xiaolong Ma, Jin Lu, Geng Yuan

PDF

Open Access

TL;DR

This paper reveals that pruning-based unlearning in diffusion models can leak concept information through pruned weights, allowing concept revival attacks, and proposes defenses for safer unlearning.

Contribution

It uncovers a security vulnerability in pruning-based unlearning and introduces a novel attack framework to revive erased concepts without data or retraining.

Findings

01

Pruned weights can leak concept information.

02

Erased concepts can be revived without data or retraining.

03

Safer pruning strategies can mitigate information leakage.

Abstract

Pruning-based unlearning has recently emerged as a fast, training-free, and data-independent approach to remove undesired concepts from diffusion models. It promises high efficiency and robustness, offering an attractive alternative to traditional fine-tuning or editing-based unlearning. However, in this paper we uncover a hidden danger behind this promising paradigm. We find that the locations of pruned weights, typically set to zero during unlearning, can act as side-channel signals that leak critical information about the erased concepts. To verify this vulnerability, we design a novel attack framework capable of reviving erased concepts from pruned diffusion models in a fully data-free and training-free manner. Our experiments confirm that pruning-based unlearning is not inherently secure, as erased concepts can be effectively revived without any additional data or retraining.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Advanced Malware Detection Techniques