SecretGen: Privacy Recovery on Pre-Trained Models via Distribution   Discrimination

Zhuowen Yuan; Fan Wu; Yunhui Long; Chaowei Xiao; Bo Li

arXiv:2207.12263·cs.LG·July 26, 2022

SecretGen: Privacy Recovery on Pre-Trained Models via Distribution Discrimination

Zhuowen Yuan, Fan Wu, Yunhui Long, Chaowei Xiao, Bo Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces SecretGen, a novel framework for recovering private training data from pre-trained models without prior knowledge, highlighting privacy vulnerabilities and providing benchmarks for privacy assessment.

Contribution

SecretGen is a new method that effectively reconstructs private data from pre-trained models without needing true class predictions, advancing privacy attack techniques.

Findings

01

SecretGen can recover private data without prior true class information.

02

It outperforms baseline methods when prior knowledge is available.

03

The study provides metrics to quantify privacy risks in pre-trained models.

Abstract

Transfer learning through the use of pre-trained models has become a growing trend for the machine learning community. Consequently, numerous pre-trained models are released online to facilitate further research. However, it raises extensive concerns on whether these pre-trained models would leak privacy-sensitive information of their training data. Thus, in this work, we aim to answer the following questions: "Can we effectively recover private information from these pre-trained models? What are the sufficient conditions to retrieve such sensitive information?" We first explore different statistical information which can discriminate the private training distribution from other distributions. Based on our observations, we propose a novel private data reconstruction framework, SecretGen, to effectively recover private information. Compared with previous methods which can recover private…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ai-secure/secretgen
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning