The out-of-sample prediction error of the square-root-LASSO and related   estimators

Jos\'e Luis Montiel Olea; Cynthia Rush; Amilcar Velez; Johannes Wiesel

arXiv:2211.07608·math.ST·April 10, 2024

The out-of-sample prediction error of the square-root-LASSO and related estimators

Jos\'e Luis Montiel Olea, Cynthia Rush, Amilcar Velez, Johannes Wiesel

PDF

Open Access

TL;DR

This paper analyzes the out-of-sample prediction error of the square-root LASSO and similar estimators, providing new theoretical insights, distributionally robust interpretations, and practical guidelines for regularization and model comparison.

Contribution

It introduces conditions linking these estimators to distributionally robust optimization, offers finite-sample and asymptotic analysis, and proposes methods for regularization tuning and estimator ranking without sparsity assumptions.

Findings

01

Linear predictors minimize worst-case prediction error over Wasserstein-like distributional neighborhoods.

02

Provides finite-sample and asymptotic bounds for distributionally robust prediction error.

03

Offers practical procedures for regularization parameter selection and estimator comparison.

Abstract

We study the classical problem of predicting an outcome variable, $Y$ , using a linear combination of a $d$ -dimensional covariate vector, $X$ . We are interested in linear predictors whose coefficients solve: % \begin{align*} \inf_{\boldsymbol{\beta} \in \mathbb{R}^d} \left( \mathbb{E}_{\mathbb{P}_n} \left[ \left(Y-\mathbf{X}^{\top}\beta \right)^r \right] \right)^{1/r} +\delta \, \rho\left(\boldsymbol{\beta}\right), \end{align*} where $δ > 0$ is a regularization parameter, $ρ : R^{d} \to R_{+}$ is a convex penalty function, $P_{n}$ is the empirical distribution of the data, and $r \geq 1$ . We present three sets of new results. First, we provide conditions under which linear predictors based on these estimators % solve a \emph{distributionally robust optimization} problem: they minimize the worst-case prediction error over distributions that are close to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization