UnlearnShield: Shielding Forgotten Privacy against Unlearning Inversion

Lulu Xue; Shengshan Hu; Wei Lu; Ziqi Zhou; Yufei Song; Jianhong Cheng; Minghui Li; Yanjun Zhang; Leo Yu Zhang

arXiv:2601.20325·cs.CR·January 29, 2026

UnlearnShield: Shielding Forgotten Privacy against Unlearning Inversion

Lulu Xue, Shengshan Hu, Wei Lu, Ziqi Zhou, Yufei Song, Jianhong Cheng, Minghui Li, Yanjun Zhang, Leo Yu Zhang

PDF

Open Access

TL;DR

UnlearnShield is a novel defense mechanism designed to protect privacy by preventing data reconstruction attacks in machine unlearning, balancing privacy, accuracy, and forgetting effectiveness.

Contribution

It introduces directional perturbations and a constraint module to defend against unlearning inversion without sacrificing model utility.

Findings

01

Effective in reducing data reconstruction risk

02

Maintains a good balance between privacy and accuracy

03

Outperforms existing methods in experiments

Abstract

Machine unlearning is an emerging technique that aims to remove the influence of specific data from trained models, thereby enhancing privacy protection. However, recent research has uncovered critical privacy vulnerabilities, showing that adversaries can exploit unlearning inversion to reconstruct data that was intended to be erased. Despite the severity of this threat, dedicated defenses remain lacking. To address this gap, we propose UnlearnShield, the first defense specifically tailored to counter unlearning inversion. UnlearnShield introduces directional perturbations in the cosine representation space and regulates them through a constraint module to jointly preserve model accuracy and forgetting efficacy, thereby reducing inversion risk while maintaining utility. Experiments demonstrate that it achieves a good trade-off among privacy protection, accuracy, and forgetting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning