MPU: Towards Secure and Privacy-Preserving Knowledge Unlearning for Large Language Models

Tiantong Wang; Xinyu Yan; Tiantong Wu; Yurong Hao; Pengjun Xie; Wei Yang Bryan Lim

arXiv:2602.23798·cs.LG·May 15, 2026

MPU: Towards Secure and Privacy-Preserving Knowledge Unlearning for Large Language Models

Tiantong Wang, Xinyu Yan, Tiantong Wu, Yurong Hao, Pengjun Xie, Wei Yang Bryan Lim

PDF

1 Repo

TL;DR

This paper introduces MPU, a privacy-preserving framework for unlearning in large language models that uses multiple perturbed copies to enable local unlearning without revealing server parameters.

Contribution

MPU provides a novel, algorithm-agnostic approach for privacy-preserving unlearning by distributing perturbed model copies and aggregating updates, improving privacy without sacrificing performance.

Findings

01

MPU achieves unlearning performance close to noise-free baselines.

02

Most algorithms' performance degradation is below 1% with up to 10% noise.

03

MPU can outperform noise-free baselines for some algorithms under 1% noise.

Abstract

Machine unlearning for large language models often faces a privacy dilemma in which strict constraints prohibit sharing either the server's parameters or the client's forget set. To address this dual non-disclosure constraint, we propose MPU, an algorithm-agnostic privacy-preserving Multiple Perturbed Copies Unlearning framework that primarily introduces two server-side modules: Pre-Process for randomized copy generation and Post-Process for update aggregation. In Pre-Process, the server distributes multiple perturbed and reparameterized model instances, allowing the client to execute unlearning locally on its private forget set without accessing the server's exact original parameters. After local unlearning, the server performs Post-Process by inverting the reparameterization and aggregating updates with a harmonic denoising procedure to alleviate the impact of perturbation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Tristan0318/MPU
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.