The Price of Tolerance in Distribution Testing
Cl\'ement L. Canonne, Ayush Jain, Gautam Kamath, Jerry Li

TL;DR
None
Contribution
None
Abstract
We revisit the problem of tolerant distribution testing. That is, given samples from an unknown distribution over , is it -close to or -far from a reference distribution (in total variation distance)? Despite significant interest over the past decade, this problem is well understood only in the extreme cases. In the noiseless setting (i.e., ) the sample complexity is , strongly sublinear in the domain size. At the other end of the spectrum, when , the sample complexity jumps to the barely sublinear . However, very little is known about the intermediate regime. We fully characterize the price of tolerance in distribution testing as a function of , , , up to a single factor. Specifically, we show the sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Adversarial Robustness in Machine Learning
