Don't Forget Too Much: Towards Machine Unlearning on Feature Level
Heng Xu, Tianqing Zhu, Wanlei Zhou, Wei Zhao

TL;DR
This paper introduces a novel feature-level machine unlearning method that enables models to selectively forget specific features within instances, improving granularity and utility over traditional instance-based unlearning techniques.
Contribution
The paper proposes a new feature unlearning scheme with approaches for both annotated and unannotated scenarios, utilizing adversarial learning and interpretability techniques.
Findings
Effective removal of feature effects demonstrated across multiple models.
Improved unlearning granularity without significant utility loss.
Versatile approach applicable to various datasets and scenarios.
Abstract
Machine unlearning enables pre-trained models to remove the effect of certain portions of training data. Previous machine unlearning schemes have mainly focused on unlearning a cluster of instances or all instances belonging to a specific class. These types of unlearning might have a significant impact on the model utility; and they may be inadequate for situations where we only need to unlearn features within instances, rather than the whole instances. Due to the different granularity, current unlearning methods can hardly achieve feature-level unlearning. To address the challenges of utility and granularity, we propose a refined granularity unlearning scheme referred to as ``feature unlearning". We first explore two distinct scenarios based on whether the annotation information about the features is given: feature unlearning with known annotations and feature unlearning without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
