Differentially Private Synthetic Data Release for Topics API Outputs

Travis Dick; Alessandro Epasto; Adel Javanmard; Josh Karlin; Andres Munoz Medina; Vahab Mirrokni; Sergei Vassilvitskii; Peilin Zhong

arXiv:2506.23855·cs.CR·July 1, 2025

Differentially Private Synthetic Data Release for Topics API Outputs

Travis Dick, Alessandro Epasto, Adel Javanmard, Josh Karlin, Andres Munoz Medina, Vahab Mirrokni, Sergei Vassilvitskii, Peilin Zhong

PDF

TL;DR

This paper introduces a novel method to generate realistic, differentially private synthetic outputs for the Topics API, enabling empirical privacy analysis while protecting user data.

Contribution

The authors develop a new approach to create synthetic API data that preserves privacy and matches real data properties, facilitating transparency and research.

Findings

01

Synthetic data closely matches real API output statistics

02

Differential privacy bounds information leakage effectively

03

Open-source dataset enables further research

Abstract

The analysis of the privacy properties of Privacy-Preserving Ads APIs is an area of research that has received strong interest from academics, industry, and regulators. Despite this interest, the empirical study of these methods is hindered by the lack of publicly available data. Reliable empirical analysis of the privacy properties of an API, in fact, requires access to a dataset consisting of realistic API outputs; however, privacy concerns prevent the general release of such data to the public. In this work, we develop a novel methodology to construct synthetic API outputs that are simultaneously realistic enough to enable accurate study and provide strong privacy protections. We focus on one Privacy-Preserving Ads APIs: the Topics API, part of Google Chrome's Privacy Sandbox. We developed a methodology to generate a differentially-private dataset that closely matches the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.