$\ell_p$ Testing and Learning of Discrete Distributions
Bo Waggoner

TL;DR
This paper investigates the problems of testing uniformity and learning discrete distributions under general metrics, revealing that for p > 1, these tasks can be accomplished with sample sizes independent of the support size, contrasting with the classic case.
Contribution
It introduces new sample complexity bounds for metrics, showing support-size independence for p > 1, and demonstrates that uniformity testing can be easier with larger supports under certain conditions.
Findings
Sample complexity for testing and learning is independent of support size for p > 1.
Uniformity testing complexity varies with support size depending on p, easier for larger supports if p > 4/3.
The proposed algorithms are order-optimal for all metrics studied.
Abstract
The classic problems of testing uniformity of and learning a discrete distribution, given access to independent samples from it, are examined under general metrics. The intuitions and results often contrast with the classic case. For , we can learn and test with a number of samples that is independent of the support size of the distribution: With an tolerance , samples suffice for testing uniformity and samples suffice for learning, where is the conjugate of . As this parallels the intuition that and samples suffice for the case, it seems that acts as an upper bound on the "apparent" support size. For some metrics, uniformity testing becomes easier over larger supports: a 6-sided die…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
