LLM Unlearning Without an Expert Curated Dataset

Xiaoyuan Zhu; Muru Zhang; Ollie Liu; Robin Jia; Willie Neiswanger

arXiv:2508.06595·cs.CL·October 8, 2025

LLM Unlearning Without an Expert Curated Dataset

Xiaoyuan Zhu, Muru Zhang, Ollie Liu, Robin Jia, Willie Neiswanger

PDF

Open Access 10 Models 5 Datasets

TL;DR

This paper presents a scalable method for generating synthetic forget sets using language models to enable effective post-hoc unlearning of specific knowledge in large language models, without requiring expert-curated datasets.

Contribution

The authors introduce an automated, multi-step prompting pipeline to generate high-quality synthetic datasets for unlearning, outperforming baseline synthetic data and matching expert-curated datasets.

Findings

01

Synthetic datasets outperform baseline alternatives

02

Multi-step generation improves data diversity

03

Synthetic data matches expert-curated quality

Abstract

Modern large language models often encode sensitive, harmful, or copyrighted knowledge, raising the need for post-hoc unlearning-the ability to remove specific domains of knowledge from a model without full retraining. A major bottleneck in current unlearning pipelines is constructing effective forget sets-datasets that approximate the target domain and guide the model to forget it. In this work, we introduce a scalable, automated approach to generate high-quality forget sets using language models themselves. Our method synthesizes textbook-style data through a structured prompting pipeline, requiring only a domain name as input. Through experiments on unlearning biosecurity, cybersecurity, and Harry Potter novels, we show that our synthetic datasets consistently outperform the baseline synthetic alternatives and are comparable to the expert-curated ones. Additionally, ablation studies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education