Measurement Error in Lasso: Impact and Correction

{\O}ystein S{\o}rensen; Arnoldo Frigessi; Magne Thoresen

arXiv:1210.5378·stat.ME·January 4, 2017

Measurement Error in Lasso: Impact and Correction

{\O}ystein S{\o}rensen, Arnoldo Frigessi, Magne Thoresen

PDF

TL;DR

This paper investigates how measurement error affects lasso regression, proposes a correction method, and demonstrates improved covariate selection accuracy in high-dimensional genomic data through theoretical analysis and simulations.

Contribution

It introduces a simple correction method for measurement error in lasso, showing improved covariate selection consistency and fewer false positives in high-dimensional settings.

Findings

01

Corrected lasso achieves sign consistency under similar conditions as perfect measurement.

02

Uncorrected lasso requires stricter conditions for covariate selection.

03

Corrected lasso reduces false positives in genomic classification tasks.

Abstract

Regression with the lasso penalty is a popular tool for performing dimension reduction when the number of covariates is large. In many applications of the lasso, like in genomics, covariates are subject to measurement error. We study the impact of measurement error on linear regression with the lasso penalty, both analytically and in simulation experiments. A simple method of correction for measurement error in the lasso is then considered. In the large sample limit, the corrected lasso yields sign consistent covariate selection under conditions very similar to the lasso with perfect measurements, whereas the uncorrected lasso requires much more stringent conditions on the covariance structure of the data. Finally, we suggest methods to correct for measurement error in generalized linear models with the lasso penalty, which we study empirically in simulation experiments with logistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.