UPS delivers optimal phase diagram in high-dimensional variable   selection

Pengsheng Ji; Jiashun Jin

arXiv:1010.5028·math.ST·May 29, 2012

UPS delivers optimal phase diagram in high-dimensional variable selection

Pengsheng Ji, Jiashun Jin

PDF

TL;DR

The paper introduces UPS, a variable selection method for high-dimensional linear models with unknown sparse correlation structures, combining univariate screening and penalized MLE for effective and computationally feasible identification of relevant variables.

Contribution

It proposes the UPS method that achieves sure screening and separability, enabling efficient variable selection in high-dimensional, correlated data settings, with theoretical guarantees.

Findings

01

UPS achieves accurate variable selection in simulations.

02

The method is computationally efficient for large p and n.

03

Theoretical analysis confirms its effectiveness under sparsity.

Abstract

Consider a linear model $Y = X β + z$ , $z \sim N (0, I_{n})$ . Here, $X = X_{n, p}$ , where both $p$ and $n$ are large, but $p > n$ . We model the rows of $X$ as i.i.d. samples from $N (0, \frac{1}{n} Ω)$ , where $Ω$ is a $p \times p$ correlation matrix, which is unknown to us but is presumably sparse. The vector $β$ is also unknown but has relatively few nonzero coordinates, and we are interested in identifying these nonzeros. We propose the Univariate Penalization Screeing (UPS) for variable selection. This is a screen and clean method where we screen with univariate thresholding and clean with penalized MLE. It has two important properties: sure screening and separable after screening. These properties enable us to reduce the original regression problem to many small-size regression problems that can be fitted separately. The UPS is effective both in theory and in computation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.