Label-Only Membership Inference Attacks
Christopher A. Choquette-Choo, Florian Tramer, Nicholas Carlini, and, Nicolas Papernot

TL;DR
This paper introduces label-only membership inference attacks that do not rely on confidence scores, instead using label robustness under perturbations to reveal training data membership, effectively bypassing existing defenses.
Contribution
The paper presents novel label-only attacks that match confidence-based methods in effectiveness and break confidence masking defenses, highlighting the need for more robust privacy protections.
Findings
Label-only attacks perform on par with confidence-based attacks.
Confidence masking defenses are ineffective against label-only attacks.
Differential privacy and strong L2 regularization are the only effective defenses.
Abstract
Membership inference attacks are one of the simplest forms of privacy leakage for machine learning models: given a data point and model, determine whether the point was used to train the model. Existing membership inference attacks exploit models' abnormal confidence when queried on their training data. These attacks do not apply if the adversary only gets access to models' predicted labels, without a confidence measure. In this paper, we introduce label-only membership inference attacks. Instead of relying on confidence scores, our attacks evaluate the robustness of a model's predicted labels under perturbations to obtain a fine-grained membership signal. These perturbations include common data augmentations or adversarial examples. We empirically show that our label-only membership inference attacks perform on par with prior attacks that required access to model confidences. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Machine Learning and Algorithms
