A Method to Facilitate Membership Inference Attacks in Deep Learning Models
Zitao Chen, Karthik Pattabiraman

TL;DR
This paper introduces a new, more powerful membership inference attack in black-box settings that can de-identify training data with high accuracy while preserving model performance, revealing privacy vulnerabilities in ML models.
Contribution
The authors propose a novel membership inference attack that surpasses previous methods in effectiveness and demonstrates limitations of current privacy auditing techniques.
Findings
Achieves over 99% true positive rate at 0.1% false positive rate in membership inference
Maintains less than 1% accuracy drop in models after attack
Reveals flaws in existing membership privacy auditing methods
Abstract
Modern machine learning (ML) ecosystems offer a surging number of ML frameworks and code repositories that can greatly facilitate the development of ML models. Today, even ordinary data holders who are not ML experts can apply off-the-shelf codebase to build high-performance ML models on their data, many of which are sensitive in nature (e.g., clinical records). In this work, we consider a malicious ML provider who supplies model-training code to the data holders, does not have access to the training process, and has only black-box query access to the resulting model. In this setting, we demonstrate a new form of membership inference attack that is strictly more powerful than prior art. Our attack empowers the adversary to reliably de-identify all the training samples (average >99% attack [email protected]% FPR), and the compromised models still maintain competitive performance as their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsSparse Evolutionary Training
