Privacy Preserving Count Statistics

Lu Yu; Oluwakemi Hambolu; Yu Fu; Jon Oakley; Richard R.; Brooks

arXiv:1910.07020·cs.CR·October 17, 2019

Privacy Preserving Count Statistics

Lu Yu, Oluwakemi Hambolu, Yu Fu, Jon Oakley, Richard R., Brooks

PDF

Open Access

TL;DR

This paper explores using probabilistic counting as an anonymization method to estimate unique user counts while preserving privacy, including collision handling and a new anonymity metric.

Contribution

It extends probabilistic counting techniques for privacy preservation by incorporating collision adjustments and developing a quantitative anonymity measure.

Findings

01

Collision compensation improves estimate accuracy.

02

Proper register size ensures reliable estimates.

03

New anonymity metric quantifies privacy levels.

Abstract

The ability to preserve user privacy and anonymity is important. One of the safest ways to maintain privacy is to avoid storing personally identifiable information (PII), which poses a challenge for maintaining useful user statistics. Probabilistic counting has been used to find the cardinality of a multiset when precise counting is too resource intensive. In this paper, probabilistic counting is used as an anonymization technique that provides a reliable estimate of the number of unique users. We extend previous work in probabilistic counting by considering its use for preserving user anonymity, developing application guidelines and including hash collisions in the estimate. Our work complements previous method by attempting to explore the causes of the deviation of uncorrected estimate from the real value. The experimental results show that if the proper register size is used,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Privacy, Security, and Data Protection