Tight Data Access Bounds for Private Top-$k$ Selection
Hao Wu, Olga Ohrimenko, Anthony Wirth

TL;DR
This paper introduces a new algorithm for private top-$k$ selection that significantly reduces data access costs under differential privacy, achieving near-optimal bounds with innovative analysis and lower bounds.
Contribution
It presents the first sublinear data-access upper bound for private top-$k$ selection and establishes fundamental lower bounds for different access models.
Findings
Algorithm requires only $O(\sqrt{mk})$ expected accesses.
Exponential mechanism needs only $O(\sqrt{m})$ expected accesses.
Supporting both random and sorted accesses is necessary to avoid linear costs.
Abstract
We study the top- selection problem under the differential privacy model: items are rated according to votes of a set of clients. We consider a setting in which algorithms can retrieve data via a sequence of accesses, each either a random access or a sorted access; the goal is to minimize the total number of data accesses. Our algorithm requires only expected accesses: to our knowledge, this is the first sublinear data-access upper bound for this problem. Our analysis also shows that the well-known exponential mechanism requires only expected accesses. Accompanying this, we develop the first lower bounds for the problem, in three settings: only random accesses; only sorted accesses; a sequence of accesses of either kind. We show that, to avoid access cost, supporting *both* kinds of access is necessary, and that in this case our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Mobile Crowdsensing and Crowdsourcing
