Unified Parameter-Efficient Unlearning for LLMs

Chenlu Ding; Jiancan Wu; Yancheng Yuan; Jinda Lu; Kai Zhang; Alex Su,; Xiang Wang; Xiangnan He

arXiv:2412.00383·cs.AI·April 21, 2025

Unified Parameter-Efficient Unlearning for LLMs

Chenlu Ding, Jiancan Wu, Yancheng Yuan, Jinda Lu, Kai Zhang, Alex Su,, Xiang Wang, Xiangnan He

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces LLMEraser, a novel framework for efficient, instance-wise unlearning in large language models that addresses privacy concerns without extensive retraining.

Contribution

The paper presents LLMEraser, a new unlearning method that uses influence functions for precise parameter adjustments, applicable across diverse unlearning tasks in LLMs.

Findings

01

LLMEraser effectively handles various unlearning scenarios.

02

It maintains model performance after unlearning.

03

The approach is efficient and scalable.

Abstract

The advent of Large Language Models (LLMs) has revolutionized natural language processing, enabling advanced understanding and reasoning capabilities across a variety of tasks. Fine-tuning these models for specific domains, particularly through Parameter-Efficient Fine-Tuning (PEFT) strategies like LoRA, has become a prevalent practice due to its efficiency. However, this raises significant privacy and security concerns, as models may inadvertently retain and disseminate sensitive or undesirable information. To address these issues, we introduce a novel instance-wise unlearning framework, LLMEraser, which systematically categorizes unlearning tasks and applies precise parameter adjustments using influence functions. Unlike traditional unlearning techniques that are often limited in scope and require extensive retraining, LLMEraser is designed to handle a broad spectrum of unlearning…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 8Confidence 4

Strengths

The paper unifies three unlearning objective under a single simple framework. The paper directly addresses the complexity concerns the framework seems to have in two simple ways: using fast Hessian-vector product, and using mini-batches. It then provides a convincing argument for why such problem formulation would converge using SGD.

Weaknesses

The paper is a bit hard to read. I would suggest putting most equations in the appendix and focus on high-level explanation in the body of the paper. It is unclear exactly what exactly how adapters are used in Algorithm 1. The evaluation is a bit limited, consider comparing this method performance to other unlearning approaches. Evaluating efficiency in seconds can be a nice addition but the main evaluation should compare an asymptotic running-time complexity.

Reviewer 02Rating 8Confidence 3

Strengths

1. The problem is important and interesting. The three unlearning tasks are practical and meaningful in real-world settings. 2. The unlearning approach edits model parameters based on influence functions, which avoids finetuning or re-training the models 3. The evaluations are comprehensive. 4. The writing is clear and easy to follow.

Weaknesses

My concern is mainly about evaluation datasets. 1. Although the paper claims to address privacy concerns in LLMs, the authors primarily evaluated the method using tabular datasets for recommendation tasks and a multimodal dataset for a classification task. While these tasks are representative, they are relatively simple and straightforward. Although the authors applied prompt engineering to those data, the training data are still very similar. 2. In real AI applications, LLMs are used for te

Reviewer 03Rating 6Confidence 3

Strengths

1) The paper shows the motivation and derivation of the proposed algorithm/framework. 2) The proposed algorithm demonstrates that it can effectively maintain good performance on different tasks, including recommendation and relation mining tasks. 3) The proposed algorithm is also shown to have high efficiency compared to the existing algorithms.

Weaknesses

1) All the experiments show how well the models can perform after unlearning a certain amount of training samples. However, it seems the paper does not present or compare how effectively the algorithms can let the model "forget" those training samples, which is the original goal of the unlearning algorithm. 2) As PEFT is fine-tuning the model, it is unclear how to distinguish the influence of a sample when it is in the fine-tuning training set or the pre-training training set. When the sample wa

Code & Models

Repositories

oceanoceanna/LLMEraser
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Advanced Data Compression Techniques