Which Distribution Distances are Sublinearly Testable?
Constantinos Daskalakis, Gautam Kamath, John Wright

TL;DR
This paper characterizes the complexity of testing whether two distributions are close or far under various distances, identifying which problems admit sublinear algorithms and providing matching bounds, thus advancing understanding of distribution testing complexities.
Contribution
It provides a comprehensive analysis of the complexity of distribution testing across multiple distances, including new bounds for sublinear testers and differences between identity and equivalence testing.
Findings
Identified which distribution testing problems are sublinearly testable.
Provided matching upper and lower bounds for various distance pairs.
Highlighted differences between identity and equivalence testing in terms of tolerance.
Abstract
Given samples from an unknown distribution and a description of a distribution , are and close or far? This question of "identity testing" has received significant attention in the case of testing whether and are equal or far in total variation distance. However, in recent work, the following questions have been been critical to solving problems at the frontiers of distribution testing: -Alternative Distances: Can we test whether and are far in other distances, say Hellinger? -Tolerance: Can we test when and are close, rather than equal? And if so, close in which distances? Motivated by these questions, we characterize the complexity of distribution testing under a variety of distances, including total variation, , Hellinger, Kullback-Leibler, and . For each pair of distances and , we study the complexity of testing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
