Testing Imprecise Hypotheses
Lucas Kania, Tudor Manole, Larry Wasserman, Sivaraman Balakrishnan

TL;DR
This paper explores the theoretical limits of tolerant hypothesis testing, analyzing the trade-offs between neighborhood size and test power across various models, and critiques classical methods like chi-squared tests.
Contribution
It characterizes the optimal information-theoretic trade-offs in tolerant testing for Gaussian, regression, and density models, and evaluates classical test sub-optimality.
Findings
Optimal trade-offs characterized for Gaussian sequence model.
Classical chi-squared test found sub-optimal for tolerant testing.
Proposes simple alternative hypothesis tests.
Abstract
Many scientific applications involve testing theories that are only partially specified. This task often amounts to testing the goodness-of-fit of a candidate distribution while allowing for reasonable deviations from it. The tolerant testing framework provides a systematic way of constructing such tests. Rather than testing the simple null hypothesis that data was drawn from a candidate distribution, a tolerant test assesses whether the data is consistent with any distribution that lies within a given neighborhood of the candidate. As this neighborhood grows, the tolerance to misspecification increases, while the power of the test decreases. In this work, we characterize the information-theoretic trade-off between the size of the neighborhood and the power of the test, in several canonical models. On the one hand, we characterize the optimal trade-off for tolerant testing in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods in Clinical Trials · Bayesian Methods and Mixture Models
