Impact of Dataset Properties on Membership Inference Vulnerability of Deep Transfer Learning

Marlon Tobaben; Hibiki Ito; Joonas J\"alk\"o; Yuan He; and Antti Honkela

arXiv:2402.06674·cs.CR·February 3, 2026·1 cites

Impact of Dataset Properties on Membership Inference Vulnerability of Deep Transfer Learning

Marlon Tobaben, Hibiki Ito, Joonas J\"alk\"o, Yuan He, and Antti Honkela

PDF

Open Access 1 Video

TL;DR

This paper investigates how dataset size and properties influence the susceptibility of fine-tuned neural networks to membership inference attacks, revealing a power-law relationship and the impact of dataset size on privacy protection.

Contribution

It provides both empirical and theoretical analysis of MIA vulnerability in transfer learning, highlighting the effect of dataset properties on privacy risks.

Findings

01

Vulnerability decreases with more examples per class following a power law.

02

Large dataset sizes are required to protect the most vulnerable points.

03

Empirical and theoretical models align in describing MIA vulnerability.

Abstract

Membership inference attacks (MIAs) are used to test practical privacy of machine learning models. MIAs complement formal guarantees from differential privacy (DP) under a more realistic adversary model. We analyse MIA vulnerability of fine-tuned neural networks both empirically and theoretically, the latter using a simplified model of fine-tuning. We show that the vulnerability of non-DP models when measured as the attacker advantage at a fixed false positive rate reduces according to a simple power law as the number of examples per class increases. A similar power-law applies even for the most vulnerable points, but the dataset size needed for adequate protection of the most vulnerable points is very large.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Impact of Dataset Properties on Membership Inference Vulnerability of Deep Transfer Learning· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data

MethodsSparse Evolutionary Training · Focus