Retain-Neutral Surrogates for Min-Max Unlearning
Junhao Cai, Dohun Kim, Dowon Kim, Sung Il Choi, Chengjun Jin, Juhyun Park, Changhee Joo

TL;DR
This paper introduces ROSU, a novel method for machine unlearning that constrains surrogate construction to improve retain-neutrality, especially effective in high-coupling regimes.
Contribution
ROSU is a new unlearning approach that maximizes forget gain while maintaining retain neutrality through a closed-form perturbation, with theoretical and empirical advantages.
Findings
ROSU provides a curvature-controlled second-order bound on retain damage.
ROSU reduces surrogate retain loss in high-coupling regimes.
Empirical results show ROSU's effectiveness across vision and language benchmarks.
Abstract
Machine unlearning seeks to remove the influence of designated training data while preserving performance on the remaining data. Approximate unlearning can be viewed as a local editing problem; in min-max unlearning, the key local object is the surrogate point at which the retain objective is evaluated. When forget and retain gradients are strongly aligned, an unconstrained forget-maximizing perturbation can move to a surrogate point that increases retain loss. We propose Retain-Orthogonal Surrogate Unlearning (ROSU), which constrains the inner surrogate construction by maximizing first-order forget gain subject to zero first-order retain change under a fixed perturbation budget. This yields a closed-form retain-orthogonal perturbation, a lightweight transported outer update, and amplification along the retain-neutral direction. Our analysis establishes (i) a curvature-controlled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
