Membership Inference Attacks Beyond Overfitting

Mona Khalil; Alberto Blanco-Justicia; Najeeb Jebreel; Josep Domingo-Ferrer

arXiv:2511.16792·cs.CR·November 24, 2025

Membership Inference Attacks Beyond Overfitting

Mona Khalil, Alberto Blanco-Justicia, Najeeb Jebreel, Josep Domingo-Ferrer

PDF

Open Access

TL;DR

This paper investigates why machine learning models leak training data information beyond overfitting, identifying outliers as vulnerable samples and proposing targeted defenses to improve privacy protections.

Contribution

It reveals that non-overfitted models can still leak information through outliers and suggests defenses specifically targeting these vulnerable data points.

Findings

01

Outliers within classes are more vulnerable to membership inference attacks.

02

Non-overfitted models can leak training data information.

03

Targeted defenses can mitigate vulnerabilities of outlier samples.

Abstract

Membership inference attacks (MIAs) against machine learning (ML) models aim to determine whether a given data point was part of the model training data. These attacks may pose significant privacy risks to individuals whose sensitive data were used for training, which motivates the use of defenses such as differential privacy, often at the cost of high accuracy losses. MIAs exploit the differences in the behavior of a model when making predictions on samples it has seen during training (members) versus those it has not seen (non-members). Several studies have pointed out that model overfitting is the major factor contributing to these differences in behavior and, consequently, to the success of MIAs. However, the literature also shows that even non-overfitted ML models can leak information about a small subset of their training data. In this paper, we investigate the root causes of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Explainable Artificial Intelligence (XAI)