Understanding Membership Inferences on Well-Generalized Learning Models
Yunhui Long, Vincent Bindschaedler, Lei Wang, Diyue Bu, Xiaofeng Wang,, Haixu Tang, Carl A. Gunter, and Kai Chen

TL;DR
This paper investigates membership inference attacks on well-generalized models, revealing vulnerabilities even without overfitting, and introduces novel techniques to identify and exploit these weak points.
Contribution
It demonstrates that well-generalized models can still be vulnerable to membership inference attacks using new instance selection and detection methods.
Findings
Overfitting is not necessary for successful MIA.
Novel techniques can identify vulnerable instances with high precision.
Existing generalization methods are less effective against these vulnerabilities.
Abstract
Membership Inference Attack (MIA) determines the presence of a record in a machine learning model's training data by querying the model. Prior work has shown that the attack is feasible when the model is overfitted to its training data or when the adversary controls the training algorithm. However, when the model is not overfitted and the adversary does not control the training algorithm, the threat is not well understood. In this paper, we report a study that discovers overfitting to be a sufficient but not a necessary condition for an MIA to succeed. More specifically, we demonstrate that even a well-generalized model contains vulnerable instances subject to a new generalized MIA (GMIA). In GMIA, we use novel techniques for selecting vulnerable instances and detecting their subtle influences ignored by overfitting metrics. Specifically, we successfully identify individual records with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning
