Resolving the Complexity of Some Data Privacy Problems

Jeremiah Blocki; Ryan Williams

arXiv:1004.3811·cs.CC·April 26, 2010·5 cites

Resolving the Complexity of Some Data Privacy Problems

Jeremiah Blocki, Ryan Williams

PDF

Open Access

TL;DR

This paper analyzes the computational complexity of data privacy methods like k-anonymity and l-diversity, providing both hardness results and efficient algorithms for specific cases, advancing understanding of their practical applicability.

Contribution

It offers new complexity classifications for k-anonymity and l-diversity, including polynomial-time algorithms for certain scenarios and NP-hardness proofs for others.

Findings

01

2-anonymity is in P.

02

3-anonymity with 27 attributes is MAX SNP-hard.

03

k-anonymity is NP-hard in general.

Abstract

We formally study two methods for data sanitation that have been used extensively in the database community: k-anonymity and l-diversity. We settle several open problems concerning the difficulty of applying these methods optimally, proving both positive and negative results: 1. 2-anonymity is in P. 2. The problem of partitioning the edges of a triangle-free graph into 4-stars (degree-three vertices) is NP-hard. This yields an alternative proof that 3-anonymity is NP-hard even when the database attributes are all binary. 3. 3-anonymity with only 27 attributes per record is MAX SNP-hard. 4. For databases with n rows, k-anonymity is in O(4^n poly(n)) time for all k > 1. 5. For databases with n rows and l <= log_{2c+2} log n attributes over an alphabet of cardinality c = O(1), k-anonymity is in P. Assuming c, l = O(1), k-anonymity is in O(n). 6. 3-diversity with binary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Internet Traffic Analysis and Secure E-voting