SoK: Machine Unlearning for Large Language Models
Jie Ren, Yue Xing, Yingqian Cui, Charu C. Aggarwal, Hui Liu

TL;DR
This paper provides a comprehensive survey of machine unlearning in large language models, proposing a new intention-based taxonomy, analyzing evaluation strategies, and discussing practical challenges for deployment.
Contribution
It introduces an intention-oriented taxonomy for unlearning methods, revisits the distinction between true removal and suppression, and evaluates current evaluation metrics and practical challenges.
Findings
Many removal methods may function as suppression rather than true removal
Current evaluation metrics have limitations in assessing unlearning effectiveness
Scalability and sequential unlearning are significant practical challenges
Abstract
Large language model (LLM) unlearning has become a critical topic in machine learning, aiming to eliminate the influence of specific training data or knowledge without retraining the model from scratch. A variety of techniques have been proposed, including Gradient Ascent, model editing, and re-steering hidden representations. While existing surveys often organize these methods by their technical characteristics, such classifications tend to overlook a more fundamental dimension: the underlying intention of unlearning--whether it seeks to truly remove internal knowledge or merely suppress its behavioral effects. In this SoK paper, we propose a new taxonomy based on this intention-oriented perspective. Building on this taxonomy, we make three key contributions. First, we revisit recent findings suggesting that many removal methods may functionally behave like suppression, and explore…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
