Unlearn What You Want to Forget: Efficient Unlearning for LLMs

Jiaao Chen; Diyi Yang

arXiv:2310.20150·cs.CL·November 1, 2023·1 cites

Unlearn What You Want to Forget: Efficient Unlearning for LLMs

Jiaao Chen, Diyi Yang

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel, efficient method for unlearning specific data from large language models by introducing lightweight unlearning layers and a fusion mechanism, enabling data removal without full retraining.

Contribution

The authors propose a lightweight unlearning framework with a fusion mechanism for sequential data removal in LLMs, improving efficiency and flexibility over existing methods.

Findings

01

Effective data removal demonstrated on classification tasks

02

Maintains model performance after unlearning

03

Outperforms state-of-the-art baselines

Abstract

Large language models (LLMs) have achieved significant progress from pre-training on and memorizing a wide range of textual data, however, this process might suffer from privacy issues and violations of data protection regulations. As a result, the ability to easily remove data related to individual users from such models while not deteriorating their predictive quality after the removal becomes increasingly important. To address these issues, in this work, we propose an efficient unlearning framework that could efficiently update LLMs without having to retrain the whole model after data removals, by introducing lightweight unlearning layers learned with a selective teacher-student objective into the transformers. In addition, we introduce a fusion mechanism to effectively combine different unlearning layers that learns to forget different sets of data to handle a sequence of forgetting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SALT-NLP/Efficient_Unlearning
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Topic Modeling · Natural Language Processing Techniques