The Boundary Between Privacy and Utility in Data Anonymization
Vibhor Rastogi, Dan Suciu, Sungho Hong

TL;DR
This paper explores the balance between privacy and utility in data anonymization, proposing a new practical definition and an algorithm that achieves both under certain probabilistic conditions.
Contribution
It introduces a novel privacy-utility framework based on probability relations and presents a new anonymization algorithm using random deletions and insertions.
Findings
No useful anonymization exists when Pr(t) = Omega(n/sqrt(m)).
A concrete anonymization algorithm is effective when Pr(t) = O(n/m).
The proposed method differs from k-anonymization and offers practical privacy-utility trade-offs.
Abstract
We consider the privacy problem in data publishing: given a relation I containing sensitive information 'anonymize' it to obtain a view V such that, on one hand attackers cannot learn any sensitive information from V, and on the other hand legitimate users can use V to compute useful statistics on I. These are conflicting goals. We use a definition of privacy that is derived from existing ones in the literature, which relates the a priori probability of a given tuple t, Pr(t), with the a posteriori probability, Pr(t | V), and propose a novel and quite practical definition for utility. Our main result is the following. Denoting n the size of I and m the size of the domain from which I was drawn (i.e. n < m) then: when the a priori probability is Pr(t) = Omega(n/sqrt(m)) for some t, there exists no useful anonymization algorithm, while when Pr(t) = O(n/m) for all tuples t, then we give a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Cryptography and Data Security
