TMI! Finetuned Models Leak Private Information from their Pretraining   Data

John Abascal; Stanley Wu; Alina Oprea; Jonathan Ullman

arXiv:2306.01181·cs.LG·October 18, 2024·1 cites

TMI! Finetuned Models Leak Private Information from their Pretraining Data

John Abascal, Stanley Wu, Alina Oprea, Jonathan Ullman

PDF

Open Access 1 Repo

TL;DR

This paper introduces TMI, a novel attack that can infer whether specific data was part of a model's pretraining set by analyzing a finetuned model, revealing privacy risks in transfer learning.

Contribution

The paper proposes a new membership inference attack, TMI, that exploits memorization in finetuned models to reveal pretraining data membership, highlighting privacy vulnerabilities.

Findings

01

TMI successfully infers pretraining data membership across vision and language tasks.

02

The attack remains effective even with differential privacy during finetuning.

03

Open-source implementation is available for reproducibility.

Abstract

Transfer learning has become an increasingly popular technique in machine learning as a way to leverage a pretrained model trained for one task to assist with building a finetuned model for a related task. This paradigm has been especially popular for $privacy$ in machine learning, where the pretrained model is considered public, and only the data for finetuning is considered sensitive. However, there are reasons to believe that the data used for pretraining is still sensitive, making it essential to understand how much information the finetuned model leaks about the pretraining data. In this work we propose a new membership-inference threat model where the adversary only has access to the finetuned model and would like to infer the membership of the pretraining data. To realize this threat model, we implement a novel metaclassifier-based attack, $TMI$ , that leverages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

johnmath/tmi-pets24
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Topic Modeling