Parameterized Complexity of the k-anonymity Problem
Stefano Beretta, Paola Bonizzoni, Gianluca Della Vedova, Riccardo, Dondi, Yuri Pirola

TL;DR
This paper explores the computational complexity of the k-anonymity problem, revealing its hardness under various parameters and providing fixed parameter algorithms for specific cases.
Contribution
It demonstrates the problem's W[1]-hardness when parameterized by solution size, and offers a fixed parameter algorithm based on alphabet size and columns.
Findings
k-anonymity is W[1]-hard when parameterized by solution size
A fixed parameter algorithm exists for alphabet size and number of columns
The problem remains APX-hard even with length-bounded records and k=3
Abstract
The problem of publishing personal data without giving up privacy is becoming increasingly important. An interesting formalization that has been recently proposed is the -anonymity. This approach requires that the rows of a table are partitioned in clusters of size at least and that all the rows in a cluster become the same tuple, after the suppression of some entries. The natural optimization problem, where the goal is to minimize the number of suppressed entries, is known to be APX-hard even when the records values are over a binary alphabet and , and when the records have length at most 8 and . In this paper we study how the complexity of the problem is influenced by different parameters. In this paper we follow this direction of research, first showing that the problem is W[1]-hard when parameterized by the size of the solution (and the value ). Then we exhibit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
