LUME: LLM Unlearning with Multitask Evaluations

Anil Ramakrishna; Yixin Wan; Xiaomeng Jin; Kai-Wei Chang; Zhiqi Bu,; Bhanukiran Vinzamuri; Volkan Cevher; Mingyi Hong; Rahul Gupta

arXiv:2502.15097·cs.CL·February 28, 2025

LUME: LLM Unlearning with Multitask Evaluations

Anil Ramakrishna, Yixin Wan, Xiaomeng Jin, Kai-Wei Chang, Zhiqi Bu,, Bhanukiran Vinzamuri, Volkan Cevher, Mingyi Hong, Rahul Gupta

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces LUME, a multi-task benchmark for evaluating unlearning methods in large language models, focusing on removing sensitive, copyrighted, or private content without full retraining.

Contribution

It presents a new multi-task benchmark (LUME), releases fine-tuned LLMs, and evaluates recent unlearning algorithms with detailed metrics and analysis.

Findings

01

Unlearning algorithms show varied effectiveness across tasks.

02

Some methods struggle with sensitive and copyrighted content.

03

Benchmark reveals limitations of current unlearning techniques.

Abstract

Unlearning aims to remove copyrighted, sensitive, or private content from large language models (LLMs) without a full retraining. In this work, we develop a multi-task unlearning benchmark (LUME) which features three tasks: (1) unlearn synthetically generated creative short novels, (2) unlearn synthetic biographies with sensitive information, and (3) unlearn a collection of public biographies. We further release two fine-tuned LLMs of 1B and 7B parameter sizes as the target models. We conduct detailed evaluations of several recently proposed unlearning algorithms and present results on carefully crafted metrics to understand their behavior and limitations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-science/lume-llm-unlearning
none

Videos

LUME: LLM Unlearning with Multitask Evaluations· underline

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Authorship Attribution and Profiling