Local Differentially Private Fuzzy Counting in Stream Data using Probabilistic Data Structure
Dinusha Vatsalan, Raghav Bhaskar, and Mohamed Ali Kaafar

TL;DR
This paper introduces a novel local differential privacy mechanism for streaming data that enables fuzzy counting of similar items with high utility, improved efficiency, and formal privacy guarantees, suitable for real-time applications.
Contribution
It presents a new privacy-preserving fuzzy counting method for streaming data that outperforms prior approaches in utility and query efficiency under similar privacy budgets.
Findings
Significantly higher accuracy in count estimation.
Reduced response time for real-time queries.
Effective privacy guarantees with formal proofs.
Abstract
Privacy-preserving estimation of counts of items in streaming data finds applications in several real-world scenarios including word auto-correction and traffic management applications. Recent works of RAPPOR and Apple's count-mean sketch (CMS) algorithm propose privacy preserving mechanisms for count estimation in large volumes of data using probabilistic data structures like counting Bloom filter and CMS. However, these existing methods fall short in providing a sound solution for real-time streaming data applications. In this work, we propose a novel (local) Differentially private mechanism that provides high utility for the streaming data count estimation problem with similar or even lower privacy budgets while providing: a) fuzzy counting to report counts of related or similar items (for instance to account for typing errors and data variations), and b) improved querying efficiency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
