Delete and Retain: Efficient Unlearning for Document Classification

Aadya Goel; Mayuri Sridhar

arXiv:2512.13711·cs.LG·December 17, 2025

Delete and Retain: Efficient Unlearning for Document Classification

Aadya Goel, Mayuri Sridhar

PDF

Open Access

TL;DR

This paper introduces Hessian Reassignment, a fast, model-agnostic method for class-level unlearning in document classifiers, achieving near full retrain accuracy and improved privacy guarantees.

Contribution

The paper presents a novel two-step unlearning approach that efficiently removes class influence from document classifiers, with a decision-space guarantee and improved privacy.

Findings

01

Achieves accuracy close to full retraining after unlearning

02

Runs orders of magnitude faster than full retraining

03

Reduces membership-inference advantage on removed class

Abstract

Machine unlearning aims to efficiently remove the influence of specific training data from a model without full retraining. While much progress has been made in unlearning for LLMs, document classification models remain relatively understudied. In this paper, we study class-level unlearning for document classifiers and present Hessian Reassignment, a two-step, model-agnostic solution. First, we perform a single influence-style update that subtracts the contribution of all training points from the target class by solving a Hessian-vector system with conjugate gradients, requiring only gradient and Hessian-vector products. Second, in contrast to common unlearning baselines that randomly reclassify deleted-class samples, we enforce a decision-space guarantee via Top-1 classification. On standard text benchmarks, Hessian Reassignment achieves retained-class accuracy close to full…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification