Improved Localized Machine Unlearning Through the Lens of Memorization
Reihaneh Torkzadehmahani, Reza Nasirigerdeh, Georgios Kaissis, Daniel, Rueckert, Gintare Karolina Dziugaite, Eleni Triantafillou

TL;DR
This paper introduces a novel localized unlearning method called DEL, which effectively removes data influence with minimal parameter modification, improving unlearning performance and test accuracy.
Contribution
It proposes an improved localization strategy inspired by memorization literature and a new unlearning algorithm, DEL, that outperforms existing methods in efficiency and accuracy.
Findings
DEL achieves state-of-the-art unlearning metrics.
DEL modifies fewer parameters while maintaining high performance.
DEL outperforms existing localized unlearning methods in accuracy.
Abstract
Machine unlearning refers to removing the influence of a specified subset of training data from a machine learning model, efficiently, after it has already been trained. This is important for key applications, including making the model more accurate by removing outdated, mislabeled, or poisoned data. In this work, we study localized unlearning, where the unlearning algorithm operates on a (small) identified subset of parameters. Drawing inspiration from the memorization literature, we propose an improved localization strategy that yields strong results when paired with existing unlearning algorithms. We also propose a new unlearning algorithm, Deletion by Example Localization (DEL), that resets the parameters deemed-to-be most critical according to our localization strategy, and then finetunes them. Our extensive experiments on different datasets, forget sets and metrics reveal that…
Peer Reviews
Decision·Submitted to ICLR 2025
- I liked the initial idea of investigating localized unlearning based on memorization. - The proposed method was partially successful on some forgetting benchmarks.
- The method is based on a lot of assumptions without much justification, but with intuition. Thus, it is very hard to see if the proposed method is indeed ok in terms of unlearning (while preserving the rest!). - It is very hard to see the core contribution clearly due to poor writing. It was very hard to read and follow. - Experiments look quite limited in terms of benchmarks (datasets, compared methods). I am afraid that the localized unlearning approach may hurt the preservation of remainin
This method achieves state-of-the-art unlearning performance while requiring a small modification on a subset of model parameters. This method also minimized unnecessary parameter while preserving the model efficiency.
The weakness of this method is limited experiments on the public dataset, only applied on CIFAR-10 and SVHN datasets, as well as the limitation on larger models.
1. The localized unlearning is new and meaningful research area and the motivation to leverage the memorization is reasonable and insightful. 2. The experiments and findings are validated with various metrics and existing unlearning algorithms and show consistently good results. 3. The paper is well formatted and arranged so that easy to understand.
1. There are several mathematical definitions such as the Unlearning and Label memorization. However, I did not find close connections or logical relations between them. If necessary, I expect the author to use these definitions to derive some theorems closely based on the proposed algorithm. For example, it is difficult to see theoretically or empirically if the proposed algorithm can make distribution the same as the model trained without that data. 2. Following the above, I understand in thi
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExperimental Learning in Engineering · Educational Technology and Assessment
