TL;DR
This paper investigates the limits of differentially private algorithms for counting distinct elements in distributed settings, establishing lower bounds and proposing protocols with optimal error bounds across various privacy models.
Contribution
It introduces new lower bounds for error in local and shuffle models and presents a protocol with near-optimal error in the multi-message shuffle setting, advancing understanding of privacy-error trade-offs.
Findings
Non-interactive local setting error is at least linear in n.
Single-message shuffle setting also has linear error lower bound.
Multi-message shuffle protocol achieves 7 error of , matching known lower bounds.
Abstract
We study the setup where each of users holds an element from a discrete set, and the goal is to count the number of distinct elements across all users, under the constraint of -differentially privacy: - In the non-interactive local setting, we prove that the additive error of any protocol is for any constant and for any inverse polynomial in . - In the single-message shuffle setting, we prove a lower bound of on the error for any constant and for some inverse quasi-polynomial in . We do so by building on the moment-matching method from the literature on distribution estimation. - In the multi-message shuffle setting, we give a protocol with at most one message per user in expectation and with an error of for any constant and for any inverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
On Distributed Differential Privacy and Counting Distinct Elements· youtube
