MeGU: Machine-Guided Unlearning with Target Feature Disentanglement
Haoyu Wang, Zhuo Huang, Xiaolong Wang, Bo Han, Zhiwei Lin, Tongliang Liu

TL;DR
MeGU introduces a novel framework for machine unlearning that leverages concept-aware re-alignment and feature disentanglement to effectively erase target data influence while maintaining model utility.
Contribution
The paper proposes MeGU, a new unlearning method using large language models for semantic re-alignment and feature noise to improve selective forgetting.
Findings
Effective target data erasure with minimal utility loss
Utilizes large language models for semantic guidance
Disentangles features to improve unlearning precision
Abstract
The growing concern over training data privacy has elevated the "Right to be Forgotten" into a critical requirement, thereby raising the demand for effective Machine Unlearning. However, existing unlearning approaches commonly suffer from a fundamental trade-off: aggressively erasing the influence of target data often degrades model utility on retained data, while conservative strategies leave residual target information intact. In this work, the intrinsic representation properties learned during model pretraining are analyzed. It is demonstrated that semantic class concepts are entangled at the feature-pattern level, sharing associated features while preserving concept-specific discriminative components. This entanglement fundamentally limits the effectiveness of existing unlearning paradigms. Motivated by this insight, we propose Machine-Guided Unlearning (MeGU), a novel framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)
