Nearly-Linear Time Private Hypothesis Selection with the Optimal Approximation Factor
Maryam Aliakbarpour, Zhan Shi, Ria Stevens, Vincent X. Wang

TL;DR
This paper introduces a differentially private hypothesis selection algorithm that runs in nearly-linear time, achieves the optimal approximation factor, and maintains polylogarithmic sample complexity, resolving a key open problem.
Contribution
It presents the first nearly-linear time differentially private hypothesis selection algorithm with optimal approximation factor.
Findings
Achieves optimal approximation factor of 3.
Runs in nearly-linear time with respect to the number of hypotheses.
Maintains polylogarithmic sample complexity.
Abstract
Estimating the density of a distribution from its samples is a fundamental problem in statistics. Hypothesis selection addresses the setting where, in addition to a sample set, we are given candidate distributions -- referred to as hypotheses -- and the goal is to determine which one best describes the underlying data distribution. This problem is known to be solvable very efficiently, requiring roughly samples and running in time. The quality of the output is measured via the total variation distance to the unknown distribution, and the approximation factor of the algorithm determines how large this distance is compared to the optimal distance achieved by the best candidate hypothesis. It is known that is the optimal approximation factor for this problem. We study hypothesis selection under the constraint of differential privacy. We propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDistributed Sensor Networks and Detection Algorithms · Target Tracking and Data Fusion in Sensor Networks · Fault Detection and Control Systems
