A Method to Facilitate Membership Inference Attacks in Deep Learning   Models

Zitao Chen; Karthik Pattabiraman

arXiv:2407.01919·cs.CR·July 3, 2024

A Method to Facilitate Membership Inference Attacks in Deep Learning Models

Zitao Chen, Karthik Pattabiraman

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new, more powerful membership inference attack in black-box settings that can de-identify training data with high accuracy while preserving model performance, revealing privacy vulnerabilities in ML models.

Contribution

The authors propose a novel membership inference attack that surpasses previous methods in effectiveness and demonstrates limitations of current privacy auditing techniques.

Findings

01

Achieves over 99% true positive rate at 0.1% false positive rate in membership inference

02

Maintains less than 1% accuracy drop in models after attack

03

Reveals flaws in existing membership privacy auditing methods

Abstract

Modern machine learning (ML) ecosystems offer a surging number of ML frameworks and code repositories that can greatly facilitate the development of ML models. Today, even ordinary data holders who are not ML experts can apply off-the-shelf codebase to build high-performance ML models on their data, many of which are sensitive in nature (e.g., clinical records). In this work, we consider a malicious ML provider who supplies model-training code to the data holders, does not have access to the training process, and has only black-box query access to the resulting model. In this setting, we demonstrate a new form of membership inference attack that is strictly more powerful than prior art. Our attack empowers the adversary to reliably de-identify all the training samples (average >99% attack [email protected]% FPR), and the compromised models still maintain competitive performance as their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DependableSystemsLab/code_poison_MIA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsSparse Evolutionary Training