TMI! Finetuned Models Leak Private Information from their Pretraining Data
John Abascal, Stanley Wu, Alina Oprea, Jonathan Ullman

TL;DR
This paper introduces TMI, a novel attack that can infer whether specific data was part of a model's pretraining set by analyzing a finetuned model, revealing privacy risks in transfer learning.
Contribution
The paper proposes a new membership inference attack, TMI, that exploits memorization in finetuned models to reveal pretraining data membership, highlighting privacy vulnerabilities.
Findings
TMI successfully infers pretraining data membership across vision and language tasks.
The attack remains effective even with differential privacy during finetuning.
Open-source implementation is available for reproducibility.
Abstract
Transfer learning has become an increasingly popular technique in machine learning as a way to leverage a pretrained model trained for one task to assist with building a finetuned model for a related task. This paradigm has been especially popular for in machine learning, where the pretrained model is considered public, and only the data for finetuning is considered sensitive. However, there are reasons to believe that the data used for pretraining is still sensitive, making it essential to understand how much information the finetuned model leaks about the pretraining data. In this work we propose a new membership-inference threat model where the adversary only has access to the finetuned model and would like to infer the membership of the pretraining data. To realize this threat model, we implement a novel metaclassifier-based attack, , that leverages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Topic Modeling
