Inference-time Unlearning Using Conformal Prediction

Somnath Basu Roy Chowdhury; Rahul Kidambi; Avinava Dubey; David Wang; Gokhan Mergen; Amr Ahmed; Aranyak Mehta

arXiv:2602.03787·cs.LG·February 4, 2026

Inference-time Unlearning Using Conformal Prediction

Somnath Basu Roy Chowdhury, Rahul Kidambi, Avinava Dubey, David Wang, Gokhan Mergen, Amr Ahmed, Aranyak Mehta

PDF

Open Access

TL;DR

This paper introduces an inference-time unlearning framework for generative models that uses conformal prediction and a verifier to efficiently remove specific information without retraining, achieving significant error reduction.

Contribution

It proposes a novel inference-time unlearning method leveraging conformal prediction and a verifier, avoiding parameter updates and improving efficiency and guarantees.

Findings

01

Reduces unlearning error by up to 93%

02

Outperforms existing methods on challenging benchmarks

03

Provides distribution-free unlearning guarantees

Abstract

Machine unlearning is the process of efficiently removing specific information from a trained machine learning model without retraining from scratch. Existing unlearning methods, which often provide provable guarantees, typically involve retraining a subset of model parameters based on a forget set. While these approaches show promise in certain scenarios, their underlying assumptions are often challenged in real-world applications -- particularly when applied to generative models. Furthermore, updating parameters using these unlearning procedures often degrades the general-purpose capabilities the model acquired during pre-training. Motivated by these shortcomings, this paper considers the paradigm of inference time unlearning -- wherein, the generative model is equipped with an (approximately correct) verifier that judges whether the model's response satisfies appropriate unlearning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification