Generalized Error Exponents For Small Sample Universal Hypothesis Testing
Dayu Huang, Sean Meyn

TL;DR
This paper introduces a new large deviations-based performance criterion for small sample universal hypothesis testing in high-dimensional settings where the sample size is smaller than the number of outcomes, providing insights beyond classical methods.
Contribution
It proposes a generalized error exponent criterion for analyzing tests in small sample, high-dimensional regimes and identifies optimal tests achieving this criterion.
Findings
Optimal error probability decays as exp{-(n^2/m)J} for some J>0
Separable statistic-based tests, including coincidence-based, are optimal under the new criterion
Pearson's chi-square test has zero error exponent, performing worse than optimal tests
Abstract
The small sample universal hypothesis testing problem is investigated in this paper, in which the number of samples is smaller than the number of possible outcomes . The goal of this work is to find an appropriate criterion to analyze statistical tests in this setting. A suitable model for analysis is the high-dimensional model in which both and increase to infinity, and . A new performance criterion based on large deviations analysis is proposed and it generalizes the classical error exponent applicable for large sample problems (in which ). This generalized error exponent criterion provides insights that are not available from asymptotic consistency or central limit theorem analysis. The following results are established for the uniform null distribution: (i) The best achievable probability of error decays as for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
