A Simple Correction Procedure for High-Dimensional Generalized Linear Models with Measurement Error
Michael Byrd, Monnie McGee

TL;DR
This paper introduces a simple correction procedure for high-dimensional generalized linear models with measurement error, enabling more accurate estimation and variable selection in contaminated data scenarios.
Contribution
It proposes a novel correction method using the Imputation-Regularized Optimization algorithm that simplifies high-dimensional errors-in-variables modeling.
Findings
Reduces false positives in gene microarray data analysis
Retains most true positives after correction
Allows use of standard linear regression methods with contaminated covariates
Abstract
We consider high-dimensional generalized linear models when the covariates are contaminated by measurement error. Estimates from errors-in-variables regression models are well-known to be biased in traditional low-dimensional settings if the error is unincorporated. Such models have recently become of interest when regularizing penalties are added to the estimation procedure. Unfortunately, correcting for the mismeasurements can add undue computational difficulties onto the optimization, which a new tool set for practitioners to successfully use the models. We investigate a general procedure that utilizes the recently proposed Imputation-Regularized Optimization algorithm for high-dimensional errors-in-variables models, which we implement for continuous, binary, and count response type. Crucially, our method allows for off-the-shelf linear regression methods to be employed in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Optimal Experimental Design Methods · Gene expression and cancer classification
