Statistical Roughness-Informed Machine Unlearning
Mohammad Partohaghighi, Roummel Marcia, Bruce J. West, YangQuan Chen

TL;DR
This paper introduces SRAGU, a novel layer-wise spectral reweighting method for machine unlearning that enhances stability and effectiveness in removing data influence from deep neural networks, especially under large or adversarial deletions.
Contribution
The paper proposes SRAGU, a new unlearning algorithm that adaptively reweights layer updates based on spectral diagnostics, improving stability and accuracy in unlearning tasks.
Findings
SRAGU improves stability in unlearning large or adversarial data deletions.
Spectral reweighting concentrates updates in stable layers, reducing catastrophic forgetting.
Empirical results show better alignment with retrained models and reduced leakage signals.
Abstract
Machine unlearning aims to remove the influence of a designated forget set from a trained model while preserving utility on the retained data. In modern deep networks, approximate unlearning frequently fails under large or adversarial deletions due to pronounced layer-wise heterogeneity: some layers exhibit stable, well-regularized representations while others are brittle, undertrained, or overfit, so naive update allocation can trigger catastrophic forgetting or unstable dynamics. We propose Statistical-Roughness Adaptive Gradient Unlearning (SRAGU), a mechanism-first unlearning algorithm that reallocates unlearning updates using layer-wise statistical roughness operationalized via heavy-tailed spectral diagnostics of layer weight matrices. Starting from an Adaptive Gradient Unlearning (AGU) sensitivity signal computed on the forget set, SRAGU estimates a WeightWatcher-style…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis
