UPS delivers optimal phase diagram in high-dimensional variable selection
Pengsheng Ji, Jiashun Jin

TL;DR
The paper introduces UPS, a variable selection method for high-dimensional linear models with unknown sparse correlation structures, combining univariate screening and penalized MLE for effective and computationally feasible identification of relevant variables.
Contribution
It proposes the UPS method that achieves sure screening and separability, enabling efficient variable selection in high-dimensional, correlated data settings, with theoretical guarantees.
Findings
UPS achieves accurate variable selection in simulations.
The method is computationally efficient for large p and n.
Theoretical analysis confirms its effectiveness under sparsity.
Abstract
Consider a linear model , . Here, , where both and are large, but . We model the rows of as i.i.d. samples from , where is a correlation matrix, which is unknown to us but is presumably sparse. The vector is also unknown but has relatively few nonzero coordinates, and we are interested in identifying these nonzeros. We propose the Univariate Penalization Screeing (UPS) for variable selection. This is a screen and clean method where we screen with univariate thresholding and clean with penalized MLE. It has two important properties: sure screening and separable after screening. These properties enable us to reduce the original regression problem to many small-size regression problems that can be fitted separately. The UPS is effective both in theory and in computation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
