The Trade-off between Privacy and Fidelity via Ehrhart Theory
Arun Padakandla, P. R. Kumar, Wojciech Szpankowski

TL;DR
This paper explores the fundamental trade-off between data privacy and accuracy in database sanitization, using Ehrhart theory to precisely characterize the optimal utility-privacy balance in large datasets.
Contribution
It introduces a novel information-theoretic framework for analyzing privacy-utility trade-offs, employing Ehrhart theory to derive explicit asymptotic formulas for optimal mechanisms.
Findings
Characterizes the utility-privacy trade-off in large databases.
Provides a closed-form expression for the asymptotic growth of optimal privacy-preserving mechanisms.
Establishes a connection between distortion measures and Ehrhart series of convex polytopes.
Abstract
As an increasing amount of data is gathered nowadays and stored in databases (DBs), the question arises of how to protect the privacy of individual records in a DB even while providing accurate answers to queries on the DB. Differential Privacy (DP) has gained acceptance as a framework to quantify vulnerability of algorithms to privacy breaches. We consider the problem of how to sanitize an entire DB via a DP mechanism, on which unlimited further querying is performed. While protecting privacy, it is important that the sanitized DB still provide accurate responses to queries. The central contribution of this work is to characterize the amount of information preserved in an optimal DP DB sanitizing mechanism (DSM). We precisely characterize the utility-privacy trade-off of mechanisms that sanitize DBs in the asymptotic regime of large DBs. We study this in an information-theoretic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
