Selection of variables and dimension reduction in high-dimensional   non-parametric regression

Karine Bertin; Guillaume Lecu\'e

arXiv:0811.1115·math.ST·December 16, 2008

Selection of variables and dimension reduction in high-dimensional non-parametric regression

Karine Bertin, Guillaume Lecu\'e

PDF

TL;DR

This paper introduces an $l_1$-penalization method for variable selection and dimension reduction in high-dimensional non-parametric Gaussian regression, enabling faster estimation rates by focusing on relevant variables.

Contribution

It develops a two-step procedure combining coordinate selection with local polynomial estimation to adaptively reduce dimension and improve estimation rates.

Findings

01

Successfully selects relevant variables with high probability.

02

Achieves estimation at the rate $n^{-2eta/(2eta+d^*)}$ using the reduced dimension.

03

Demonstrates the effectiveness of $l_1$ penalization in non-parametric regression.

Abstract

We consider a $l_{1}$ -penalization procedure in the non-parametric Gaussian regression model. In many concrete examples, the dimension $d$ of the input variable $X$ is very large (sometimes depending on the number of observations). Estimation of a $β$ -regular regression function $f$ cannot be faster than the slow rate $n^{- 2 β / (2 β + d)}$ . Hopefully, in some situations, $f$ depends only on a few numbers of the coordinates of $X$ . In this paper, we construct two procedures. The first one selects, with high probability, these coordinates. Then, using this subset selection method, we run a local polynomial estimator (on the set of interesting coordinates) to estimate the regression function at the rate $n^{- 2 β / (2 β + d^{*})}$ , where $d^{*}$ , the "real" dimension of the problem (exact number of variables whom $f$ depends on), has replaced the dimension $d$ of the design. To achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.