Covariate assisted screening and estimation
Zheng Tracy Ke, Jiashun Jin, Jianqing Fan

TL;DR
This paper introduces CASE, a new two-stage variable selection method for high-dimensional linear models with sparse signals, leveraging linear filtering and graph-guided screening to improve detection of weak, rare signals.
Contribution
The paper proposes the covariate assisted screening and estimation (CASE) procedure, combining linear filtering, graph-based screening, and patching to effectively identify sparse signals in complex, nonsparse Gram matrix settings.
Findings
CASE effectively detects weak, rare signals.
The method decomposes large problems into manageable subproblems.
Patching overcomes information leakage issues.
Abstract
Consider a linear model , where and . The vector is unknown but is sparse in the sense that most of its coordinates are . The main interest is to separate its nonzero coordinates from the zero ones (i.e., variable selection). Motivated by examples in long-memory time series (Fan and Yao [Nonlinear Time Series: Nonparametric and Parametric Methods (2003) Springer]) and the change-point problem (Bhattacharya [In Change-Point Problems (South Hadley, MA, 1992) (1994) 28-56 IMS]), we are primarily interested in the case where the Gram matrix is nonsparse but sparsifiable by a finite order linear filter. We focus on the regime where signals are both rare and weak so that successful variable selection is very challenging but is still possible. We approach this problem by a new procedure called the covariate assisted screening and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
