Optimal Testing for Properties of Distributions

Jayadev Acharya; Constantinos Daskalakis; Gautam Kamath

arXiv:1507.05952·cs.DS·December 9, 2015·44 cites

Optimal Testing for Properties of Distributions

Jayadev Acharya, Constantinos Daskalakis, Gautam Kamath

PDF

Open Access

TL;DR

This paper develops sample-optimal, computationally efficient methods for testing if an unknown distribution has certain properties, like monotonicity or log-concavity, and introduces the first efficient learners for some distribution classes.

Contribution

It provides a general framework for optimal property testing of distributions and introduces the first efficient proper learners for discrete log-concave and monotone hazard rate distributions.

Findings

01

Achieved sample-optimal property testers for various distribution classes.

02

Established matching lower bounds for sample complexity and accuracy.

03

Developed the first efficient proper learners for specific distribution families.

Abstract

Given samples from an unknown distribution $p$ , is it possible to distinguish whether $p$ belongs to some class of distributions $C$ versus $p$ being far from every distribution in $C$ ? This fundamental question has received tremendous attention in statistics, focusing primarily on asymptotic analysis, and more recently in information theory and theoretical computer science, where the emphasis has been on small sample size and computational complexity. Nevertheless, even for basic properties of distributions such as monotonicity, log-concavity, unimodality, independence, and monotone-hazard rate, the optimal sample complexity is unknown. We provide a general approach via which we obtain sample-optimal and computationally efficient testers for all these distribution families. At the core of our approach is an algorithm which solves the following problem: Given…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Bayesian Modeling and Causal Inference