Unified Parameter-Efficient Unlearning for LLMs
Chenlu Ding, Jiancan Wu, Yancheng Yuan, Jinda Lu, Kai Zhang, Alex Su,, Xiang Wang, Xiangnan He

TL;DR
This paper introduces LLMEraser, a novel framework for efficient, instance-wise unlearning in large language models that addresses privacy concerns without extensive retraining.
Contribution
The paper presents LLMEraser, a new unlearning method that uses influence functions for precise parameter adjustments, applicable across diverse unlearning tasks in LLMs.
Findings
LLMEraser effectively handles various unlearning scenarios.
It maintains model performance after unlearning.
The approach is efficient and scalable.
Abstract
The advent of Large Language Models (LLMs) has revolutionized natural language processing, enabling advanced understanding and reasoning capabilities across a variety of tasks. Fine-tuning these models for specific domains, particularly through Parameter-Efficient Fine-Tuning (PEFT) strategies like LoRA, has become a prevalent practice due to its efficiency. However, this raises significant privacy and security concerns, as models may inadvertently retain and disseminate sensitive or undesirable information. To address these issues, we introduce a novel instance-wise unlearning framework, LLMEraser, which systematically categorizes unlearning tasks and applies precise parameter adjustments using influence functions. Unlike traditional unlearning techniques that are often limited in scope and require extensive retraining, LLMEraser is designed to handle a broad spectrum of unlearning…
Peer Reviews
Decision·ICLR 2025 Poster
The paper unifies three unlearning objective under a single simple framework. The paper directly addresses the complexity concerns the framework seems to have in two simple ways: using fast Hessian-vector product, and using mini-batches. It then provides a convincing argument for why such problem formulation would converge using SGD.
The paper is a bit hard to read. I would suggest putting most equations in the appendix and focus on high-level explanation in the body of the paper. It is unclear exactly what exactly how adapters are used in Algorithm 1. The evaluation is a bit limited, consider comparing this method performance to other unlearning approaches. Evaluating efficiency in seconds can be a nice addition but the main evaluation should compare an asymptotic running-time complexity.
1. The problem is important and interesting. The three unlearning tasks are practical and meaningful in real-world settings. 2. The unlearning approach edits model parameters based on influence functions, which avoids finetuning or re-training the models 3. The evaluations are comprehensive. 4. The writing is clear and easy to follow.
My concern is mainly about evaluation datasets. 1. Although the paper claims to address privacy concerns in LLMs, the authors primarily evaluated the method using tabular datasets for recommendation tasks and a multimodal dataset for a classification task. While these tasks are representative, they are relatively simple and straightforward. Although the authors applied prompt engineering to those data, the training data are still very similar. 2. In real AI applications, LLMs are used for te
1) The paper shows the motivation and derivation of the proposed algorithm/framework. 2) The proposed algorithm demonstrates that it can effectively maintain good performance on different tasks, including recommendation and relation mining tasks. 3) The proposed algorithm is also shown to have high efficiency compared to the existing algorithms.
1) All the experiments show how well the models can perform after unlearning a certain amount of training samples. However, it seems the paper does not present or compare how effectively the algorithms can let the model "forget" those training samples, which is the original goal of the unlearning algorithm. 2) As PEFT is fine-tuning the model, it is unclear how to distinguish the influence of a sample when it is in the fine-tuning training set or the pre-training training set. When the sample wa
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Advanced Data Compression Techniques
