Discrete Distribution Estimation with Local Differential Privacy: A Comparative Analysis
Ba Dung Le, Tanveer Zia

TL;DR
This paper compares various algorithms for estimating discrete data distributions under local differential privacy, analyzing their accuracy across different privacy levels and datasets.
Contribution
It provides a comprehensive comparative analysis of existing algorithms for discrete distribution estimation under local differential privacy, highlighting their relative performance.
Findings
Basic RAPPOR performs best in high privacy regimes.
k-RR algorithm excels in low privacy regimes.
k-RR, k-subset, and HR algorithms are competitive in medium privacy regimes.
Abstract
Local differential privacy is a promising privacy-preserving model for statistical aggregation of user data that prevents user privacy leakage from the data aggregator. This paper focuses on the problem of estimating the distribution of discrete user values with Local differential privacy. We review and present a comparative analysis on the performance of the existing discrete distribution estimation algorithms in terms of their accuracy on benchmark datasets. Our evaluation benchmarks include real-world and synthetic datasets of categorical individual values with the number of individuals from hundreds to millions and the domain size up to a few hundreds of values. The experimental results show that the Basic RAPPOR algorithm generally performs best for the benchmark datasets in the high privacy regime while the k-RR algorithm often gives the best estimation in the low privacy regime.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
