LUME: LLM Unlearning with Multitask Evaluations
Anil Ramakrishna, Yixin Wan, Xiaomeng Jin, Kai-Wei Chang, Zhiqi Bu,, Bhanukiran Vinzamuri, Volkan Cevher, Mingyi Hong, Rahul Gupta

TL;DR
This paper introduces LUME, a multi-task benchmark for evaluating unlearning methods in large language models, focusing on removing sensitive, copyrighted, or private content without full retraining.
Contribution
It presents a new multi-task benchmark (LUME), releases fine-tuned LLMs, and evaluates recent unlearning algorithms with detailed metrics and analysis.
Findings
Unlearning algorithms show varied effectiveness across tasks.
Some methods struggle with sensitive and copyrighted content.
Benchmark reveals limitations of current unlearning techniques.
Abstract
Unlearning aims to remove copyrighted, sensitive, or private content from large language models (LLMs) without a full retraining. In this work, we develop a multi-task unlearning benchmark (LUME) which features three tasks: (1) unlearn synthetically generated creative short novels, (2) unlearn synthetic biographies with sensitive information, and (3) unlearn a collection of public biographies. We further release two fine-tuned LLMs of 1B and 7B parameter sizes as the target models. We conduct detailed evaluations of several recently proposed unlearning algorithms and present results on carefully crafted metrics to understand their behavior and limitations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Authorship Attribution and Profiling
