LLM Unlearning Without an Expert Curated Dataset
Xiaoyuan Zhu, Muru Zhang, Ollie Liu, Robin Jia, Willie Neiswanger

TL;DR
This paper presents a scalable method for generating synthetic forget sets using language models to enable effective post-hoc unlearning of specific knowledge in large language models, without requiring expert-curated datasets.
Contribution
The authors introduce an automated, multi-step prompting pipeline to generate high-quality synthetic datasets for unlearning, outperforming baseline synthetic data and matching expert-curated datasets.
Findings
Synthetic datasets outperform baseline alternatives
Multi-step generation improves data diversity
Synthetic data matches expert-curated quality
Abstract
Modern large language models often encode sensitive, harmful, or copyrighted knowledge, raising the need for post-hoc unlearning-the ability to remove specific domains of knowledge from a model without full retraining. A major bottleneck in current unlearning pipelines is constructing effective forget sets-datasets that approximate the target domain and guide the model to forget it. In this work, we introduce a scalable, automated approach to generate high-quality forget sets using language models themselves. Our method synthesizes textbook-style data through a structured prompting pipeline, requiring only a domain name as input. Through experiments on unlearning biosecurity, cybersecurity, and Harry Potter novels, we show that our synthetic datasets consistently outperform the baseline synthetic alternatives and are comparable to the expert-curated ones. Additionally, ablation studies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗WhyTheMoon/Llama-3-8B-Instruct_RMU_Keyword-Biomodel· 1 dl1 dl
- 🤗WhyTheMoon/Llama-3-8B-Instruct_RMU_Textbook-Biomodel· 1 dl1 dl
- 🤗WhyTheMoon/Llama-3-8B-Instruct_RMU_Filter-Biomodel· 1 dl1 dl
- 🤗WhyTheMoon/Llama-3-8B-Instruct_RMU_Textbook-Cybermodel· 1 dl1 dl
- 🤗WhyTheMoon/Llama-3-8B-Instruct_RMU_Filter-Cybermodel
- 🤗WhyTheMoon/Llama-3-8B-Instruct_RMU_Keyword-Cybermodel
- 🤗WhyTheMoon/Llama-3-8B-Instruct_RMU_Textbook-HPmodel· 2 dl2 dl
- 🤗WhyTheMoon/Llama-3-8B-Instruct_RMU_Textbook-HP-Simplestmodel
- 🤗WhyTheMoon/Mistral-7B-Instruct-v0.3_RMU_Textbook-Biomodel· 1 dl1 dl
- 🤗WhyTheMoon/Mistral-7B-Instruct-v0.3_RMU_Filter-Biomodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education
