Robust Variable Selection for High-dimensional Regression with Missing Data and Measurement Errors

Zhenhao Zhang; Yunquan Song

arXiv:2410.16722·stat.ME·July 1, 2025

Robust Variable Selection for High-dimensional Regression with Missing Data and Measurement Errors

Zhenhao Zhang, Yunquan Song

PDF

Open Access

TL;DR

This paper introduces a robust variable selection method for high-dimensional regression with missing data and measurement errors, using an exponential loss function and inverse probability weighting to improve accuracy and robustness.

Contribution

It proposes a novel exponential loss function with a tuning parameter and the Atan penalty, enhancing robustness in variable selection under data imperfections.

Findings

01

The method improves variable selection accuracy in simulations.

02

It performs well on real breast cancer data.

03

The Atan penalty outperforms traditional penalties.

Abstract

In our paper, we focus on robust variable selection for missing data and measurement error. Missing data and measurement errors can lead to confusing data distribution. We propose an exponential loss function with a tuning parameter to apply to Missing and measurement errors data. By adjusting the parameter, the loss function can be better and more robust under various data distributions. We use inverse probability weighting and additive error models to address missing data and measurement errors. Also, we find that the Atan punishment method works better. We used Monte Carlo simulations to assess the validity of robust variable selection and validated our findings with the breast cancer dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Advanced Statistical Methods and Models · Statistical Methods and Inference

MethodsFocus