Second-Order Asymptotics of Two-Sample Tests
K V Harsha, Jithin Ravi, Tobias Koch

TL;DR
This paper analyzes the second-order asymptotics of two-sample tests, generalizing the Gutman test to arbitrary divergences, and establishes optimality and connections to robust goodness-of-fit testing.
Contribution
It introduces a divergence test generalization of the Gutman test, proving its optimal first-order and second-order asymptotic properties and linking it to the GLRT.
Findings
Divergence test achieves optimal first-order exponent.
Second-order asymptotics match those of the Gutman test.
Gutman test is shown to be the GLRT for two-sample testing.
Abstract
In two-sampling testing, one observes two independent sequences of independent and identically distributed random variables distributed according to the distributions and and wishes to decide whether (null hypothesis) or (alternative hypothesis). The Gutman test for this problem compares the empirical distributions of the observed sequences and decides on the null hypothesis if the Jensen-Shannon (JS) divergence between these empirical distributions is below a given threshold. This paper proposes a generalization of the Gutman test, termed \emph{divergence test}, which replaces the JS divergence by an arbitrary divergence. For this test, the exponential decay of the type-II error probability for a fixed type-I error probability is studied. First, it is shown that the divergence test achieves the optimal first-order exponent, irrespective of the choice…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Distributed Sensor Networks and Detection Algorithms · Statistical Distribution Estimation and Applications
