OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics

Vineeth Dorna; Anmol Mekala; Wenlong Zhao; Andrew McCallum; Zachary C. Lipton; J. Zico Kolter; Pratyush Maini

arXiv:2506.12618·cs.CL·November 11, 2025

OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics

Vineeth Dorna, Anmol Mekala, Wenlong Zhao, Andrew McCallum, Zachary C. Lipton, J. Zico Kolter, Pratyush Maini

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces OpenUnlearning, a comprehensive framework for benchmarking LLM unlearning methods and metrics, aiming to standardize evaluation, improve reproducibility, and accelerate progress in safe model deployment.

Contribution

It presents a unified, extensible benchmarking platform that integrates multiple algorithms, evaluation metrics, and datasets for systematic analysis of LLM unlearning techniques.

Findings

01

OpenUnlearning includes 13 algorithms and 16 evaluation metrics.

02

Benchmarking reveals significant variability in unlearning effectiveness.

03

The framework facilitates analysis of forgetting behaviors across 450+ checkpoints.

Abstract

Robust unlearning is crucial for safely deploying large language models (LLMs) in environments where data privacy, model safety, and regulatory compliance must be ensured. Yet the task is inherently challenging, partly due to difficulties in reliably measuring whether unlearning has truly occurred. Moreover, fragmentation in current methodologies and inconsistent evaluation metrics hinder comparative analysis and reproducibility. To unify and accelerate research efforts, we introduce OpenUnlearning, a standardized and extensible framework designed explicitly for benchmarking both LLM unlearning methods and metrics. OpenUnlearning integrates 13 unlearning algorithms and 16 diverse evaluations across 3 leading benchmarks (TOFU, MUSE, and WMDP) and also enables analyses of forgetting behaviors across 450+ checkpoints we publicly release. Leveraging OpenUnlearning, we propose a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

locuslab/open-unlearning
pytorchOfficial

Videos

OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics· slideslive

Taxonomy

TopicsMathematics, Computing, and Information Processing · Natural Language Processing Techniques

MethodsFragmentation