Optimality of the max test for detecting sparse signals with Gaussian or   heavier tail

Xiao Li; William Fithian

arXiv:2006.12489·math.ST·June 24, 2020·1 cites

Optimality of the max test for detecting sparse signals with Gaussian or heavier tail

Xiao Li, William Fithian

PDF

Open Access

TL;DR

This paper proves the optimality of the max test for detecting sparse signals in high-dimensional data with Gaussian or heavier tails, extending previous results to more general non-null mean distributions.

Contribution

It establishes the max test's optimal detection boundary in the weak, sparse regime for distributions with tails no lighter than Gaussian, and analyzes the limitations of the higher criticism test.

Findings

01

Max test is optimal in the weak, sparse regime for Gaussian or heavier tails.

02

Higher criticism can have very low power with polynomial tail distributions.

03

Theoretical and simulation results confirm the detection boundaries.

Abstract

A fundamental problem in high-dimensional testing is that of global null testing: testing whether the null holds simultaneously in all of $n$ hypotheses. The max test, which uses the smallest of the $n$ marginal p-values as its test statistic, enjoys widespread popularity for its simplicity and robustness. However, its theoretical performance relative to other tests has been called into question. In the Gaussian sequence version of the global testing problem, Donoho and Jin (2004) discovered a so-called "weak, sparse" asymptotic regime in which the higher criticism and Berk-Jones tests achieve a better detection boundary than the max test when all of the nonzero signal strengths are identical. We study a more general model in which the non-null means are drawn from a generic distribution, and show that the detection boundary for the max test is optimal in the "weak, sparse" regime,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods in Clinical Trials · Statistical Methods and Bayesian Inference