Subset Privacy: Draw from an Obfuscated Urn

Ganghua Wang; Jie Ding

arXiv:2107.02013·cs.CR·July 6, 2021·1 cites

Subset Privacy: Draw from an Obfuscated Urn

Ganghua Wang, Jie Ding

PDF

Open Access

TL;DR

This paper introduces subset privacy, a new local privacy model that privatizes categorical data by replacing values with random subsets, enabling accurate distribution estimation and independence testing while providing privacy guarantees.

Contribution

It proposes the subset privacy mechanism, along with methods for distribution estimation and independence testing from subset-private data, with theoretical guarantees and practical evaluation.

Findings

01

Effective distribution estimation from subset-private data

02

Successful independence testing with privacy guarantees

03

Demonstrated performance on real-world datasets

Abstract

With the rapidly increasing ability to collect and analyze personal data, data privacy becomes an emerging concern. In this work, we develop a new statistical notion of local privacy to protect each categorical data that will be collected by untrusted entities. The proposed solution, named subset privacy, privatizes the original data value by replacing it with a random subset containing that value. We develop methods for the estimation of distribution functions and independence testing from subset-private data with theoretical guarantees. We also study different mechanisms to realize the subset privacy and evaluation metrics to quantify the amount of privacy in practice. Experimental results on both simulated and real-world datasets demonstrate the encouraging performance of the developed concepts and methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Cryptography and Data Security