Modified Cross-Validation for Penalized High-Dimensional Linear   Regression Models

Yi Yu; Yang Feng

arXiv:1309.2068·stat.ME·September 10, 2013

Modified Cross-Validation for Penalized High-Dimensional Linear Regression Models

Yi Yu, Yang Feng

PDF

Open Access

TL;DR

This paper introduces a modified cross-validation method for penalized high-dimensional linear regression models, improving variable selection accuracy over traditional methods in various settings.

Contribution

It proposes a new cross-validation approach tailored for Lasso and Elastic Net models, enhancing penalty parameter selection in high-dimensional contexts.

Findings

01

Modified CV reduces noise variable inclusion

02

Performs well across diverse coefficient and correlation settings

03

Outperforms standard K-fold CV in simulations and real data

Abstract

In this paper, for Lasso penalized linear regression models in high-dimensional settings, we propose a modified cross-validation method for selecting the penalty parameter. The methodology is extended to other penalties, such as Elastic Net. We conduct extensive simulation studies and real data analysis to compare the performance of the modified cross-validation method with other methods. It is shown that the popular $K$ -fold cross-validation method includes many noise variables in the selected model, while the modified cross-validation works well in a wide range of coefficient and correlation settings. Supplemental materials containing the computer code are available online.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference