Privacy Preserving Count Statistics
Lu Yu, Oluwakemi Hambolu, Yu Fu, Jon Oakley, Richard R., Brooks

TL;DR
This paper explores using probabilistic counting as an anonymization method to estimate unique user counts while preserving privacy, including collision handling and a new anonymity metric.
Contribution
It extends probabilistic counting techniques for privacy preservation by incorporating collision adjustments and developing a quantitative anonymity measure.
Findings
Collision compensation improves estimate accuracy.
Proper register size ensures reliable estimates.
New anonymity metric quantifies privacy levels.
Abstract
The ability to preserve user privacy and anonymity is important. One of the safest ways to maintain privacy is to avoid storing personally identifiable information (PII), which poses a challenge for maintaining useful user statistics. Probabilistic counting has been used to find the cardinality of a multiset when precise counting is too resource intensive. In this paper, probabilistic counting is used as an anonymization technique that provides a reliable estimate of the number of unique users. We extend previous work in probabilistic counting by considering its use for preserving user anonymity, developing application guidelines and including hash collisions in the estimate. Our work complements previous method by attempting to explore the causes of the deviation of uncorrected estimate from the real value. The experimental results show that if the proper register size is used,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Privacy, Security, and Data Protection
