Strict Optimality of Frequency and Distribution Estimation Under Local Differential Privacy
Mingen Pan

TL;DR
This paper proves the strict optimality of a linear estimator for frequency and distribution estimation under local differential privacy, achieving theoretical lower bounds with minimal communication cost and practical algorithms.
Contribution
It introduces a theoretically optimal linear estimator under LDP, with minimal communication, and demonstrates its practical effectiveness through a modified Count-Mean Sketch.
Findings
Achieves the $ ext{L}_2$ loss lower bound with a simple linear estimator.
Communication cost can be as low as $ ext{log}_2(rac{d(d-1)}{2}+1)$ bits.
Modified Count-Mean Sketch closely matches theoretical optimality in practice.
Abstract
This paper establishes the strict optimality in precision for frequency and distribution estimation under local differential privacy (LDP). We prove that a linear estimator with a symmetric and extremal configuration, and a constant support size equal to an optimized value, is sufficient to achieve the theoretical lower bound of the loss for both frequency and distribution estimation. The theoretical lower bound is also achieved asymptotically. Furthermore, we derive that the communication cost of such an optimal estimator can be as low as bits, where denotes the dictionary size, and propose an algorithm to generate this optimal estimator. In addition, we introduce a modified Count-Mean Sketch and demonstrate that it is practically indistinguishable from theoretical optimality with a sufficiently large dictionary size…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Wireless Communication Security Techniques · Security in Wireless Sensor Networks
