The Unseen Threat: Residual Knowledge in Machine Unlearning under Perturbed Samples

Hsiang Hsu; Pradeep Niroula; Zichang He; Ivan Brugere; Freddy Lecue; Chun-Fu Chen

arXiv:2601.22359·cs.LG·February 2, 2026

The Unseen Threat: Residual Knowledge in Machine Unlearning under Perturbed Samples

Hsiang Hsu, Pradeep Niroula, Zichang He, Ivan Brugere, Freddy Lecue, Chun-Fu Chen

PDF

Open Access

TL;DR

This paper identifies a privacy risk in machine unlearning where residual knowledge persists around forgotten samples, especially under adversarial perturbations, and proposes a fine-tuning method to mitigate this issue.

Contribution

The paper formalizes the residual knowledge vulnerability in machine unlearning and introduces RURK, a fine-tuning strategy to reduce residual knowledge in high-dimensional models.

Findings

01

Residual knowledge is prevalent across existing unlearning methods.

02

The proposed RURK method effectively reduces residual knowledge.

03

Residual knowledge persists even when models are unlearned, especially under adversarial perturbations.

Abstract

Machine unlearning offers a practical alternative to avoid full model re-training by approximately removing the influence of specific user data. While existing methods certify unlearning via statistical indistinguishability from re-trained models, these guarantees do not naturally extend to model outputs when inputs are adversarially perturbed. In particular, slight perturbations of forget samples may still be correctly recognized by the unlearned model - even when a re-trained model fails to do so - revealing a novel privacy risk: information about the forget samples may persist in their local neighborhood. In this work, we formalize this vulnerability as residual knowledge and show that it is inevitable in high-dimensional settings. To mitigate this risk, we propose a fine-tuning strategy, named RURK, that penalizes the model's ability to re-recognize perturbed forget samples.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Privacy-Preserving Technologies in Data