Protecting Privacy Through Approximating Optimal Parameters for Sequence   Unlearning in Language Models

Dohyun Lee; Daniel Rim; Minseok Choi; Jaegul Choo

arXiv:2406.14091·cs.CL·June 21, 2024

Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models

Dohyun Lee, Daniel Rim, Minseok Choi, Jaegul Choo

PDF

Open Access

TL;DR

This paper introduces POP, a novel method for unlearning specific data from language models by approximating optimal parameters, effectively balancing privacy protection and model performance.

Contribution

The work proposes a new unlearning technique that approximates optimal gradient updates, improving privacy protection while maintaining model accuracy better than existing methods.

Findings

01

POP outperforms state-of-the-art unlearning methods across multiple benchmarks.

02

It effectively forgets target sequences with minimal performance degradation.

03

Remnant Memorization Accuracy quantifies privacy risks and validates unlearning effectiveness.

Abstract

Although language models (LMs) demonstrate exceptional capabilities on various tasks, they are potentially vulnerable to extraction attacks, which represent a significant privacy risk. To mitigate the privacy concerns of LMs, machine unlearning has emerged as an important research area, which is utilized to induce the LM to selectively forget about some of its training data. While completely retraining the model will guarantee successful unlearning and privacy assurance, it is impractical for LMs, as it would be time-consuming and resource-intensive. Prior works efficiently unlearn the target token sequences, but upon subsequent iterations, the LM displays significant degradation in performance. In this work, we propose Privacy Protection via Optimal Parameters (POP), a novel unlearning method that effectively forgets the target token sequences from the pretrained LM by applying optimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data