Membership Inference Attacks Beyond Overfitting
Mona Khalil, Alberto Blanco-Justicia, Najeeb Jebreel, Josep Domingo-Ferrer

TL;DR
This paper investigates why machine learning models leak training data information beyond overfitting, identifying outliers as vulnerable samples and proposing targeted defenses to improve privacy protections.
Contribution
It reveals that non-overfitted models can still leak information through outliers and suggests defenses specifically targeting these vulnerable data points.
Findings
Outliers within classes are more vulnerable to membership inference attacks.
Non-overfitted models can leak training data information.
Targeted defenses can mitigate vulnerabilities of outlier samples.
Abstract
Membership inference attacks (MIAs) against machine learning (ML) models aim to determine whether a given data point was part of the model training data. These attacks may pose significant privacy risks to individuals whose sensitive data were used for training, which motivates the use of defenses such as differential privacy, often at the cost of high accuracy losses. MIAs exploit the differences in the behavior of a model when making predictions on samples it has seen during training (members) versus those it has not seen (non-members). Several studies have pointed out that model overfitting is the major factor contributing to these differences in behavior and, consequently, to the success of MIAs. However, the literature also shows that even non-overfitted ML models can leak information about a small subset of their training data. In this paper, we investigate the root causes of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Explainable Artificial Intelligence (XAI)
