Survey on Privacy-Preserving Techniques for Data Publishing
T\^ania Carvalho, Nuno Moniz, Pedro Faria, Lu\'is Antunes

TL;DR
This survey reviews privacy-preserving techniques for microdata de-identification, discussing challenges, approaches, and measures to balance privacy protection with data utility in response to growing privacy concerns.
Contribution
It provides a comprehensive overview of existing techniques, taxonomies, and theoretical analyses of privacy-preserving methods in data publishing.
Findings
Identifies key challenges in balancing privacy and data utility.
Reviews various privacy measures and their effectiveness.
Highlights open issues and future research directions.
Abstract
The exponential growth of collected, processed, and shared microdata has given rise to concerns about individuals' privacy. As a result, laws and regulations have emerged to control what organisations do with microdata and how they protect it. Statistical Disclosure Control seeks to reduce the risk of confidential information disclosure by de-identifying them. Such de-identification is guaranteed through privacy-preserving techniques. However, de-identified data usually results in loss of information, with a possible impact on data analysis precision and model predictive performance. The main goal is to protect the individuals' privacy while maintaining the interpretability of the data, i.e. its usefulness. Statistical Disclosure Control is an area that is expanding and needs to be explored since there is still no solution that guarantees optimal privacy and utility. This survey focuses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection
