The Hardness and Approximation Algorithms for L-Diversity

Xiaokui Xiao; Ke Yi; Yufei Tao

arXiv:0912.5426·cs.DB·December 31, 2009

The Hardness and Approximation Algorithms for L-Diversity

Xiaokui Xiao, Ke Yi, Yufei Tao

PDF

Open Access

TL;DR

This paper provides the first theoretical analysis of l-diversity, proving NP-hardness of optimal solutions and introducing an approximation algorithm with practical effectiveness validated by experiments.

Contribution

It is the first to study the theoretical complexity of l-diversity and offers a novel approximation algorithm with proven bounds.

Findings

01

Optimal l-diverse generalization is NP-hard.

02

An (l*d)-approximation algorithm is proposed.

03

Experimental results validate the algorithm's effectiveness.

Abstract

The existing solutions to privacy preserving publication can be classified into the theoretical and heuristic categories. The former guarantees provably low information loss, whereas the latter incurs gigantic loss in the worst case, but is shown empirically to perform well on many real inputs. While numerous heuristic algorithms have been developed to satisfy advanced privacy principles such as l-diversity, t-closeness, etc., the theoretical category is currently limited to k-anonymity which is the earliest principle known to have severe vulnerability to privacy attacks. Motivated by this, we present the first theoretical study on l-diversity, a popular principle that is widely adopted in the literature. First, we show that optimal l-diverse generalization is NP-hard even when there are only 3 distinct sensitive values in the microdata. Then, an (l*d)-approximation algorithm is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Complexity and Algorithms in Graphs