Recover-to-Forget: Gradient Reconstruction from LoRA for Efficient LLM Unlearning
Yezi Liu, Hanning Chen, Wenjun Huang, Yang Ni, Mohsen Imani

TL;DR
Recover-to-Forget (R2F) introduces a scalable, efficient unlearning method for large language models by reconstructing full-model gradients from low-rank LoRA updates, avoiding full retraining.
Contribution
The paper proposes R2F, a novel framework that enables efficient unlearning in LLMs through gradient reconstruction from LoRA, applicable to black-box models and trained on a proxy model.
Findings
R2F effectively unlearns data while maintaining model performance.
The method reduces computational costs compared to full-model fine-tuning.
Experimental results validate the scalability and practicality of R2F.
Abstract
Unlearning in large foundation models (e.g., LLMs) is essential for enabling dynamic knowledge updates, enforcing data deletion rights, and correcting model behavior. However, existing unlearning methods often require full-model fine-tuning or access to the original training data, which limits their scalability and practicality. In this work, we introduce Recover-to-Forget (R2F), a novel framework for efficient unlearning in LLMs based on reconstructing full-model gradient directions from low-rank LoRA adapter updates. Rather than performing backpropagation through the full model, we compute gradients with respect to LoRA parameters using multiple paraphrased prompts and train a gradient decoder to approximate the corresponding full-model gradients. To ensure applicability to larger or black-box models, the decoder is trained on a proxy model and transferred to target models. We provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Advanced Graph Neural Networks
