A semi-automatic method to guide the choice of ridge parameter in ridge   regression

Erika Cule; Maria De Iorio

arXiv:1205.0686·stat.AP·May 4, 2012·34 cites

A semi-automatic method to guide the choice of ridge parameter in ridge regression

Erika Cule, Maria De Iorio

PDF

Open Access

TL;DR

This paper introduces a semi-automatic method for selecting the ridge parameter in ridge regression, specifically tailored for high-dimensional genomic data, to improve out-of-sample prediction accuracy.

Contribution

The authors propose a variance-based ridge parameter selection method that enhances prediction performance in high-dimensional settings, extending to binary outcomes and real genomic data.

Findings

01

Outperforms subset selection and HyperLasso in simulated data

02

Improves prediction error in high-dimensional genomic data

03

Effective for binary case-control outcomes

Abstract

We consider the application of a popular penalised regression method, Ridge Regression, to data with very high dimensions and many more covariates than observations. Our motivation is the problem of out-of-sample prediction and the setting is high-density genotype data from a genome-wide association or resequencing study. Ridge regression has previously been shown to offer improved performance for prediction when compared with other penalised regression methods. One problem with ridge regression is the choice of an appropriate parameter for controlling the amount of shrinkage of the coefficient estimates. Here we propose a method for choosing the ridge parameter based on controlling the variance of the predicted observations in the model. Using simulated data, we demonstrate that our method outperforms subset selection based on univariate tests of association and another penalised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoil Geostatistics and Mapping · Mineral Processing and Grinding