Tight Lower Bounds for Differentially Private Selection

Thomas Steinke; Jonathan Ullman

arXiv:1704.03024·cs.DS·April 12, 2017·5 cites

Tight Lower Bounds for Differentially Private Selection

Thomas Steinke, Jonathan Ullman

PDF

Open Access

TL;DR

This paper establishes tight lower bounds on the sample complexity for differentially private selection tasks, showing that existing methods are essentially optimal and extending fingerprinting techniques to sparse settings.

Contribution

It introduces a novel extension of the fingerprinting method to derive tight lower bounds for private selection problems involving sparse query sets.

Findings

01

Lower bound of n = Ω(√k log d) samples needed for private selection.

02

Existing algorithms are near-optimal given the lower bounds.

03

Extension of fingerprinting method to sparse query scenarios.

Abstract

A pervasive task in the differential privacy literature is to select the $k$ items of "highest quality" out of a set of $d$ items, where the quality of each item depends on a sensitive dataset that must be protected. Variants of this task arise naturally in fundamental problems like feature selection and hypothesis testing, and also as subroutines for many sophisticated differentially private algorithms. The standard approaches to these tasks---repeated use of the exponential mechanism or the sparse vector technique---approximately solve this problem given a dataset of $n = O (k lo g d)$ samples. We provide a tight lower bound for some very simple variants of the private selection problem. Our lower bound shows that a sample of size $n = Ω (k lo g d)$ is required even to achieve a very minimal accuracy guarantee. Our results are based on an extension of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Mobile Crowdsensing and Crowdsourcing