Utility-Preserving Differentially Private Data Releases Via Individual   Ranking Microaggregation

David S\'anchez; Josep Domingo-Ferrer; Sergio Mart\'inez; Jordi; Soria-Comas

arXiv:1512.02897·cs.CR·December 17, 2015

Utility-Preserving Differentially Private Data Releases Via Individual Ranking Microaggregation

David S\'anchez, Josep Domingo-Ferrer, Sergio Mart\'inez, Jordi, Soria-Comas

PDF

TL;DR

This paper introduces a microaggregation-based method to enhance the utility of differentially private data releases, reducing noise and preserving data usefulness, especially for large datasets with multiple attributes.

Contribution

It proposes a novel approach combining microaggregation with differential privacy to improve data utility without depending on dataset size.

Findings

01

Reduced noise in differentially private outputs

02

Improved data utility for large datasets

03

Empirical validation across multiple datasets

Abstract

Being able to release and exploit open data gathered in information systems is crucial for researchers, enterprises and the overall society. Yet, these data must be anonymized before release to protect the privacy of the subjects to whom the records relate. Differential privacy is a privacy model for anonymization that offers more robust privacy guarantees than previous models, such as $k$ -anonymity and its extensions. However, it is often disregarded that the utility of differentially private outputs is quite limited, either because of the amount of noise that needs to be added to obtain them or because utility is only preserved for a restricted type and/or a limited number of queries. On the contrary, $k$ -anonymity-like data releases make no assumptions on the uses of the protected data and, thus, do not restrict the number and type of doable analyses. Recently, some authors have…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.