Optimal Testing for Properties of Distributions
Jayadev Acharya, Constantinos Daskalakis, Gautam Kamath

TL;DR
This paper develops sample-optimal, computationally efficient methods for testing if an unknown distribution has certain properties, like monotonicity or log-concavity, and introduces the first efficient learners for some distribution classes.
Contribution
It provides a general framework for optimal property testing of distributions and introduces the first efficient proper learners for discrete log-concave and monotone hazard rate distributions.
Findings
Achieved sample-optimal property testers for various distribution classes.
Established matching lower bounds for sample complexity and accuracy.
Developed the first efficient proper learners for specific distribution families.
Abstract
Given samples from an unknown distribution , is it possible to distinguish whether belongs to some class of distributions versus being far from every distribution in ? This fundamental question has received tremendous attention in statistics, focusing primarily on asymptotic analysis, and more recently in information theory and theoretical computer science, where the emphasis has been on small sample size and computational complexity. Nevertheless, even for basic properties of distributions such as monotonicity, log-concavity, unimodality, independence, and monotone-hazard rate, the optimal sample complexity is unknown. We provide a general approach via which we obtain sample-optimal and computationally efficient testers for all these distribution families. At the core of our approach is an algorithm which solves the following problem: Given…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Bayesian Modeling and Causal Inference
