Targeted Unlearning Using Perturbed Sign Gradient Methods With Applications On Medical Images

George R. Nahass; Zhu Wang; Homa Rashidisabet; Won Hwa Kim; Sasha Hubschman; Jeffrey C. Peterson; Chad A. Purnell; Pete Setabutr; Ann Q. Tran; Darvin Yi; Sathya N. Ravi

arXiv:2505.21872·eess.IV·February 11, 2026

Targeted Unlearning Using Perturbed Sign Gradient Methods With Applications On Medical Images

George R. Nahass, Zhu Wang, Homa Rashidisabet, Won Hwa Kim, Sasha Hubschman, Jeffrey C. Peterson, Chad A. Purnell, Pete Setabutr, Ann Q. Tran, Darvin Yi, Sathya N. Ravi

PDF

Open Access

TL;DR

This paper presents a bilevel optimization-based method for machine unlearning in medical imaging, enabling efficient model updates post-deployment with guarantees and improved performance over existing approaches.

Contribution

It introduces a novel boundary-based unlearning framework with convergence guarantees, tunable tradeoffs, and model composition strategies for clinical applications.

Findings

01

Outperforms baselines on forgetting and retention metrics

02

Effective in clinical imaging datasets with device and anatomical variations

03

Supports practical, modular model maintenance in healthcare

Abstract

Machine unlearning aims to remove the influence of specific training samples from a trained model without full retraining. While prior work has largely focused on privacy-motivated settings, we recast unlearning as a general-purpose tool for post-deployment model revision. Specifically, we focus on utilizing unlearning in clinical contexts where data shifts, device deprecation, and policy changes are common. To this end, we propose a bilevel optimization formulation of boundary-based unlearning that can be solved using iterative algorithms. We provide convergence guarantees when first-order algorithms are used to unlearn. Our method introduces tunable loss design for controlling the forgetting-retention tradeoff and supports novel model composition strategies that merge the strengths of distinct unlearning runs. Across benchmark and real-world clinical imaging datasets, our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsFocus