Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism
Ery Arias-Castro, Emmanuel J. Cand\`es, Yaniv Plan

TL;DR
This paper investigates the effectiveness of ANOVA, multiple comparisons, and higher criticism methods for detecting sparse signals in high-dimensional linear models, establishing their optimality under various sparsity regimes.
Contribution
It characterizes the optimality of classical and modern testing procedures across different sparsity levels, highlighting the superiority of higher criticism in moderate to strong sparsity.
Findings
ANOVA is optimal under moderate sparsity ($0 extless\alpha extlessrac{1}{2}$) with certain design conditions.
Multiple comparison procedures are optimal when $ ext{ extgreater}3/4$ sparsity.
Higher criticism is powerful and optimal for the entire range $ extgreater 1/2$ sparsity.
Abstract
Testing for the significance of a subset of regression coefficients in a linear model, a staple of statistical analysis, goes back at least to the work of Fisher who introduced the analysis of variance (ANOVA). We study this problem under the assumption that the coefficient vector is sparse, a common situation in modern high-dimensional settings. Suppose we have covariates and that under the alternative, the response only depends upon the order of of those, . Under moderate sparsity levels, that is, , we show that ANOVA is essentially optimal under some conditions on the design. This is no longer the case under strong sparsity constraints, that is, . In such settings, a multiple comparison procedure is often preferred and we establish its optimality when . However, these two very popular methods are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
