Anonymization with Worst-Case Distribution-Based Background Knowledge

Raymond Chi-Wing Wong; Ada Wai-Chee Fu; Ke Wang; Yabo Xu; Jian Pei,; Philip S. Yu

arXiv:0909.1127·cs.DB·September 8, 2009

Anonymization with Worst-Case Distribution-Based Background Knowledge

Raymond Chi-Wing Wong, Ada Wai-Chee Fu, Ke Wang, Yabo Xu, Jian Pei,, Philip S. Yu

PDF

Open Access

TL;DR

This paper introduces a new anonymization algorithm that considers worst-case distribution-based background knowledge to enhance privacy protection while maintaining data utility.

Contribution

It is the first to address distribution-based background knowledge in the worst-case scenario for data anonymization.

Findings

01

The proposed algorithm effectively protects individual privacy.

02

The method preserves high data utility.

03

Empirical results demonstrate robustness against worst-case background knowledge.

Abstract

Background knowledge is an important factor in privacy preserving data publishing. Distribution-based background knowledge is one of the well studied background knowledge. However, to the best of our knowledge, there is no existing work considering the distribution-based background knowledge in the worst case scenario, by which we mean that the adversary has accurate knowledge about the distribution of sensitive values according to some tuple attributes. Considering this worst case scenario is essential because we cannot overlook any breaching possibility. In this paper, we propose an algorithm to anonymize dataset in order to protect individual privacy by considering this background knowledge. We prove that the anonymized datasets generated by our proposed algorithm protects individual privacy. Our empirical studies show that our method preserves high utility for the published data at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Data Quality and Management