Discrete Distribution Estimation under User-level Local Differential Privacy
Jayadev Acharya, Yuhan Liu, Ziteng Sun

TL;DR
This paper investigates the problem of estimating discrete distributions under user-level local differential privacy, revealing phase transitions and equivalences that inform privacy-utility trade-offs and connecting to shuffled differential privacy.
Contribution
It provides tight bounds for user-level LDP distribution estimation, demonstrating equivalences between multiple samples per user and more users, and links to shuffled DP for improved guarantees.
Findings
More samples per user can be simulated by more users with fewer samples each.
Phase transitions depend on the number of samples, privacy level, and estimation risk.
Algorithms achieve near-optimal error bounds, verified by simulations.
Abstract
We study discrete distribution estimation under user-level local differential privacy (LDP). In user-level -LDP, each user has samples and the privacy of all samples must be preserved simultaneously. We resolve the following dilemma: While on the one hand having more samples per user should provide more information about the underlying distribution, on the other hand, guaranteeing the privacy of all samples should make the estimation task more difficult. We obtain tight bounds for this problem under almost all parameter regimes. Perhaps surprisingly, we show that in suitable parameter regimes, having samples per user is equivalent to having times more users, each with only one sample. Our results demonstrate interesting phase transitions for and the privacy parameter in the estimation risk. Finally, connecting with recent results on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Probability and Risk Models · Vehicular Ad Hoc Networks (VANETs)
