SecretGen: Privacy Recovery on Pre-Trained Models via Distribution Discrimination
Zhuowen Yuan, Fan Wu, Yunhui Long, Chaowei Xiao, Bo Li

TL;DR
This paper introduces SecretGen, a novel framework for recovering private training data from pre-trained models without prior knowledge, highlighting privacy vulnerabilities and providing benchmarks for privacy assessment.
Contribution
SecretGen is a new method that effectively reconstructs private data from pre-trained models without needing true class predictions, advancing privacy attack techniques.
Findings
SecretGen can recover private data without prior true class information.
It outperforms baseline methods when prior knowledge is available.
The study provides metrics to quantify privacy risks in pre-trained models.
Abstract
Transfer learning through the use of pre-trained models has become a growing trend for the machine learning community. Consequently, numerous pre-trained models are released online to facilitate further research. However, it raises extensive concerns on whether these pre-trained models would leak privacy-sensitive information of their training data. Thus, in this work, we aim to answer the following questions: "Can we effectively recover private information from these pre-trained models? What are the sufficient conditions to retrieve such sensitive information?" We first explore different statistical information which can discriminate the private training distribution from other distributions. Based on our observations, we propose a novel private data reconstruction framework, SecretGen, to effectively recover private information. Compared with previous methods which can recover private…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning
