High-dimensional Gaussian model selection on a Gaussian design
Nicolas Verzelen (LM-Orsay, INRIA Saclay - Ile de France)

TL;DR
This paper introduces a flexible model selection method for high-dimensional Gaussian regression that achieves optimal rates and adapts to various settings, including when the number of variables exceeds observations.
Contribution
It proposes a penalized least-squares procedure for Gaussian model selection that handles high-dimensional data, incorporates prior knowledge, and provides non-asymptotic guarantees.
Findings
Achieves non-asymptotic oracle inequalities.
Handles p > n scenarios effectively.
Demonstrates adaptiveness and minimax optimality.
Abstract
We consider the problem of estimating the conditional mean of a real Gaussian variable where the vector of the covariates follows a joint Gaussian distribution. This issue often occurs when one aims at estimating the graph or the distribution of a Gaussian graphical model. We introduce a general model selection procedure which is based on the minimization of a penalized least-squares type criterion. It handles a variety of problems such as ordered and complete variable selection, allows to incorporate some prior knowledge on the model and applies when the number of covariates is larger than the number of observations . Moreover, it is shown to achieve a non-asymptotic oracle inequality independently of the correlation structure of the covariates. We also exhibit various minimax…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
