Learn What You Want to Unlearn: Unlearning Inversion Attacks against   Machine Unlearning

Hongsheng Hu; Shuo Wang; Tian Dong; Minhui Xue

arXiv:2404.03233·cs.CR·April 5, 2024·2 cites

Learn What You Want to Unlearn: Unlearning Inversion Attacks against Machine Unlearning

Hongsheng Hu, Shuo Wang, Tian Dong, Minhui Xue

PDF

Open Access 1 Repo

TL;DR

This paper reveals privacy vulnerabilities in machine unlearning, demonstrating how adversaries can invert models to recover sensitive data, and discusses defenses that balance privacy and utility.

Contribution

It introduces the first unlearning inversion attacks, exposing privacy risks in machine unlearning and evaluating potential defenses against these attacks.

Findings

01

Unlearning inversion attacks can reveal sensitive data information.

02

The attacks are effective across various models and unlearning methods.

03

Defenses reduce attack success but also decrease model utility.

Abstract

Machine unlearning has become a promising solution for fulfilling the "right to be forgotten", under which individuals can request the deletion of their data from machine learning models. However, existing studies of machine unlearning mainly focus on the efficacy and efficiency of unlearning methods, while neglecting the investigation of the privacy vulnerability during the unlearning process. With two versions of a model available to an adversary, that is, the original model and the unlearned model, machine unlearning opens up a new attack surface. In this paper, we conduct the first investigation to understand the extent to which machine unlearning can leak the confidential content of the unlearned data. Specifically, under the Machine Learning as a Service setting, we propose unlearning inversion attacks that can reveal the feature and label information of an unlearned sample by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tasi-lab/unlearning-inversion-attacks
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning