On the use of cross-validation for the calibration of the adaptive lasso

Ballout Nadim; Etievant Lola; Viallon Vivian

arXiv:2005.10119·stat.ME·July 16, 2021

On the use of cross-validation for the calibration of the adaptive lasso

Ballout Nadim, Etievant Lola, Viallon Vivian

PDF

Open Access 1 Repo

TL;DR

This paper examines the effectiveness of cross-validation schemes in calibrating the adaptive lasso, demonstrating that simple schemes are suboptimal compared to more elaborate methods, impacting support recovery and prediction accuracy.

Contribution

It clarifies the appropriate cross-validation schemes for adaptive lasso calibration and highlights the shortcomings of simple schemes through empirical examples.

Findings

01

Simple cross-validation schemes are suboptimal for adaptive lasso calibration.

02

More elaborate schemes improve support recovery and prediction error.

03

The study uses synthetic and real-world data to illustrate these points.

Abstract

The adaptive lasso refers to a class of methods that use weighted versions of the $L_{1}$ -norm penalty, with weights derived from an initial estimate of the parameter vector to be estimated. Irrespective of the method chosen to compute this initial estimate, the performance of the adaptive lasso critically depends on the value of a hyperparameter, which controls the magnitude of the weighted $L_{1}$ -norm in the penalized criterion. As for other machine learning methods, cross-validation is very popular for the calibration of the adaptive lasso, that this for the selection of a data-driven optimal value of this hyperparameter. However, the most simple cross-validation scheme is not valid in this context, and a more elaborate one has to be employed to guarantee an optimal calibration. The discrepancy of the simple cross-validation scheme has been well documented in other contexts, but less so…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NadimBLT/AdapGlmnet-NestedCv
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Machine Learning and Data Classification