Utility-Preserving Differentially Private Data Releases Via Individual Ranking Microaggregation
David S\'anchez, Josep Domingo-Ferrer, Sergio Mart\'inez, Jordi, Soria-Comas

TL;DR
This paper introduces a microaggregation-based method to enhance the utility of differentially private data releases, reducing noise and preserving data usefulness, especially for large datasets with multiple attributes.
Contribution
It proposes a novel approach combining microaggregation with differential privacy to improve data utility without depending on dataset size.
Findings
Reduced noise in differentially private outputs
Improved data utility for large datasets
Empirical validation across multiple datasets
Abstract
Being able to release and exploit open data gathered in information systems is crucial for researchers, enterprises and the overall society. Yet, these data must be anonymized before release to protect the privacy of the subjects to whom the records relate. Differential privacy is a privacy model for anonymization that offers more robust privacy guarantees than previous models, such as -anonymity and its extensions. However, it is often disregarded that the utility of differentially private outputs is quite limited, either because of the amount of noise that needs to be added to obtain them or because utility is only preserved for a restricted type and/or a limited number of queries. On the contrary, -anonymity-like data releases make no assumptions on the uses of the protected data and, thus, do not restrict the number and type of doable analyses. Recently, some authors have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
