Privacy Vulnerabilities of Dataset Anonymization Techniques
Eyal Nussbaum, Michael Segal

TL;DR
This paper investigates privacy vulnerabilities in common dataset anonymization techniques, specifically data perturbation and query-set-size control, revealing potential attacks and limitations in preserving user privacy.
Contribution
It analyzes the weaknesses of two anonymization methods, including a new attack on NeNDS and the conditions under which privacy can still be compromised.
Findings
NeNDS data perturbation can be partially attacked with prior knowledge.
Query-set-size control can leak information through certain query types.
Some query types under query-set-size control maintain privacy even with multiple queries.
Abstract
Vast amounts of information of all types are collected daily about people by governments, corporations and individuals. The information is collected when users register to or use on-line applications, receive health related services, use their mobile phones, utilize search engines, or perform common daily activities. As a result, there is an enormous quantity of privately-owned records that describe individuals' finances, interests, activities, and demographics. These records often include sensitive data and may violate the privacy of the users if published. The common approach to safeguarding user information, or data in general, is to limit access to the storage (usually a database) by using and authentication and authorization protocol. This way, only users with legitimate permissions can access the user data. In many cases though, the publication of user data for statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Quality and Management · Cryptography and Data Security
