The Privacy-Utility Trade-off in the Topics API
M\'ario S. Alvim, Natasha Fernandes, Annabelle McIver, Gabriel H., Nunes

TL;DR
This paper evaluates the privacy risks and utility benefits of Google's Topics API, a proposed privacy-preserving advertising tool, through theoretical analysis and real-world data experiments.
Contribution
It provides theoretical bounds on re-identification risks and utility, considering side information and differential privacy, validated by experiments.
Findings
Theoretical upper bounds on re-identification risk.
Analysis of utility for identifying popular topics.
Experimental validation with real-world data.
Abstract
The ongoing deprecation of third-party cookies by web browser vendors has sparked the proposal of alternative methods to support more privacy-preserving personalized advertising on web browsers and applications. The Topics API is being proposed by Google to provide third-parties with "coarse-grained advertising topics that the page visitor might currently be interested in". In this paper, we analyze the re-identification risks for individual Internet users and the utility provided to advertising companies by the Topics API, i.e. learning the most popular topics and distinguishing between real and random topics. We provide theoretical results dependent only on the API parameters that can be readily applied to evaluate the privacy and utility implications of future API updates, including novel general upper-bounds that account for adversaries with access to unknown, arbitrary side…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
