Improved Inference for the Signal Significance
Igor Volobouev, A. Alexandre Trindade

TL;DR
This paper analyzes likelihood-based statistics for signal detection, showing that third-order Edgeworth approximations effectively control false positive rates at low sample sizes, with likelihood ratio tests outperforming others.
Contribution
It introduces third-order Edgeworth expansion methods to improve p-value accuracy and demonstrates the superior performance of likelihood ratio tests in low-sample scenarios.
Findings
Likelihood ratio tests outperform other statistics in false negative rates.
Third-order Edgeworth approximations ensure accurate p-values at low sample sizes.
Likelihood ratio maintains better adherence to normality assumptions.
Abstract
We study the properties of several likelihood-based statistics commonly used in testing for the presence of a known signal under a mixture model with known background, but unknown signal fraction. Under the null hypothesis of no signal, all statistics follow a standard normal distribution in large samples, but substantial deviations can occur at low sample sizes. Approximations for respective -values are derived to various orders of accuracy using the methodology of Edgeworth expansions. Adherence to normality is studied, and the magnitude of deviations is quantified according to resulting inflation or deflation. We find that approximations to third-order accuracy are generally sufficient to guarantee -values with nominal false positive error rates in the five sigma range (-value ) for the classic Wald, score, and likelihood ratio (LR) statistics at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
