Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models
Dohyun Lee, Daniel Rim, Minseok Choi, Jaegul Choo

TL;DR
This paper introduces POP, a novel method for unlearning specific data from language models by approximating optimal parameters, effectively balancing privacy protection and model performance.
Contribution
The work proposes a new unlearning technique that approximates optimal gradient updates, improving privacy protection while maintaining model accuracy better than existing methods.
Findings
POP outperforms state-of-the-art unlearning methods across multiple benchmarks.
It effectively forgets target sequences with minimal performance degradation.
Remnant Memorization Accuracy quantifies privacy risks and validates unlearning effectiveness.
Abstract
Although language models (LMs) demonstrate exceptional capabilities on various tasks, they are potentially vulnerable to extraction attacks, which represent a significant privacy risk. To mitigate the privacy concerns of LMs, machine unlearning has emerged as an important research area, which is utilized to induce the LM to selectively forget about some of its training data. While completely retraining the model will guarantee successful unlearning and privacy assurance, it is impractical for LMs, as it would be time-consuming and resource-intensive. Prior works efficiently unlearn the target token sequences, but upon subsequent iterations, the LM displays significant degradation in performance. In this work, we propose Privacy Protection via Optimal Parameters (POP), a novel unlearning method that effectively forgets the target token sequences from the pretrained LM by applying optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
