A Unified Framework for Quantifying Privacy Risk in Synthetic Data
Matteo Giomi, Franziska Boenisch, Christoph Wehmeyer, Borb\'ala, Tasn\'adi

TL;DR
This paper introduces Anonymeter, a comprehensive framework for quantifying privacy risks in synthetic data, including attack-based evaluations aligned with GDPR, and demonstrates its effectiveness through extensive experiments.
Contribution
We present the first coherent, legally aligned evaluation framework for privacy risks in synthetic data, including novel attack models for singling out and linkability.
Findings
Privacy risks scale linearly with leakage levels.
Synthetic data shows low vulnerability to linkability attacks.
Anonymeter outperforms existing methods in detection and speed.
Abstract
Synthetic data is often presented as a method for sharing sensitive information in a privacy-preserving manner by reproducing the global statistical properties of the original data without disclosing sensitive information about any individual. In practice, as with other anonymization methods, privacy risks cannot be entirely eliminated. The residual privacy risks need instead to be ex-post assessed. We present Anonymeter, a statistical framework to jointly quantify different types of privacy risks in synthetic tabular datasets. We equip this framework with attack-based evaluations for the singling out, linkability, and inference risks, the three key indicators of factual anonymization according to the European General Data Protection Regulation (GDPR). To the best of our knowledge, we are the first to introduce a coherent and legally aligned evaluation of these three privacy risks for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Data Quality and Management
