Discrete Distribution Estimation under User-level Local Differential   Privacy

Jayadev Acharya; Yuhan Liu; Ziteng Sun

arXiv:2211.03757·cs.LG·November 8, 2022·1 cites

Discrete Distribution Estimation under User-level Local Differential Privacy

Jayadev Acharya, Yuhan Liu, Ziteng Sun

PDF

Open Access

TL;DR

This paper investigates the problem of estimating discrete distributions under user-level local differential privacy, revealing phase transitions and equivalences that inform privacy-utility trade-offs and connecting to shuffled differential privacy.

Contribution

It provides tight bounds for user-level LDP distribution estimation, demonstrating equivalences between multiple samples per user and more users, and links to shuffled DP for improved guarantees.

Findings

01

More samples per user can be simulated by more users with fewer samples each.

02

Phase transitions depend on the number of samples, privacy level, and estimation risk.

03

Algorithms achieve near-optimal error bounds, verified by simulations.

Abstract

We study discrete distribution estimation under user-level local differential privacy (LDP). In user-level $ε$ -LDP, each user has $m \geq 1$ samples and the privacy of all $m$ samples must be preserved simultaneously. We resolve the following dilemma: While on the one hand having more samples per user should provide more information about the underlying distribution, on the other hand, guaranteeing the privacy of all $m$ samples should make the estimation task more difficult. We obtain tight bounds for this problem under almost all parameter regimes. Perhaps surprisingly, we show that in suitable parameter regimes, having $m$ samples per user is equivalent to having $m$ times more users, each with only one sample. Our results demonstrate interesting phase transitions for $m$ and the privacy parameter $ε$ in the estimation risk. Finally, connecting with recent results on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Probability and Risk Models · Vehicular Ad Hoc Networks (VANETs)