Towards Natural Machine Unlearning
Zhengbao He, Tao Li, Xinwen Cheng, Zhehao Huang, Xiaolin Huang

TL;DR
This paper proposes a natural machine unlearning method that injects correct information into forgetting samples, reducing over-forgetting and improving robustness compared to existing approaches.
Contribution
It introduces a novel approach to machine unlearning by injecting correct information into forgetting samples, enhancing naturalness and performance.
Findings
Significantly reduces over-forgetting.
Outperforms state-of-the-art methods.
Provides robustness to hyperparameters.
Abstract
Machine unlearning (MU) aims to eliminate information that has been learned from specific training data, namely forgetting data, from a pre-trained model. Currently, the mainstream of existing MU methods involves modifying the forgetting data with incorrect labels and subsequently fine-tuning the model. While learning such incorrect information can indeed remove knowledge, the process is quite unnatural as the unlearning process undesirably reinforces the incorrect information and leads to over-forgetting. Towards more \textit{natural} machine unlearning, we inject correct information from the remaining data to the forgetting samples when changing their labels. Through pairing these adjusted samples with their labels, the model will tend to use the injected correct information and naturally suppress the information meant to be forgotten. Albeit straightforward, such a first step towards…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Processing Techniques · Neural Networks and Applications
