MPRU: Modular Projection-Redistribution Unlearning as Output Filter for Classification Pipelines
Minyi Peng, Darian Gunamardi, Ivan Tjuawinata, Kwok-Yan Lam

TL;DR
This paper introduces MPRU, a modular unlearning method that efficiently removes class information from classifiers by reversing training sequences, without needing full data access, thus enhancing scalability and practicality.
Contribution
The paper proposes a novel, model-agnostic unlearning approach using a projection-redistribution layer that simplifies removal of class knowledge without retraining from scratch.
Findings
Achieves similar output to full retraining with less computation
Works across image and tabular datasets with different models
Maintains performance while improving scalability
Abstract
As a new and promising approach, existing machine unlearning (MU) works typically emphasize theoretical formulations or optimization objectives to achieve knowledge removal. However, when deployed in real-world scenarios, such solutions typically face scalability issues and have to address practical requirements such as full access to original datasets and model. In contrast to the existing approaches, we regard classification training as a sequential process where classes are learned sequentially, which we call \emph{inductive approach}. Unlearning can then be done by reversing the last training sequence. This is implemented by appending a projection-redistribution layer in the end of the model. Such an approach does not require full access to the original dataset or the model, addressing the challenges of existing methods. This enables modular and model-agnostic deployment as an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
