Accurate Inference for Penalized Logistic Regression

Yuming Zhang; St\'ephane Guerrier; Runze Li

arXiv:2410.20045·stat.ME·October 29, 2024

Accurate Inference for Penalized Logistic Regression

Yuming Zhang, St\'ephane Guerrier, Runze Li

PDF

Open Access

TL;DR

This paper introduces a two-step method for high-dimensional logistic regression that improves inference accuracy by reducing bias, outperforming existing approaches in finite sample scenarios.

Contribution

A novel two-step procedure combining Lasso-based variable selection and bias correction for more accurate inference in high-dimensional logistic regression.

Findings

01

Significantly smaller biases than alternative methods in finite samples

02

Improved inference performance demonstrated through numerical studies

03

Effective application to alcohol consumption data analysis

Abstract

Inference for high-dimensional logistic regression models using penalized methods has been a challenging research problem. As an illustration, a major difficulty is the significant bias of the Lasso estimator, which limits its direct application in inference. Although various bias corrected Lasso estimators have been proposed, they often still exhibit substantial biases in finite samples, undermining their inference performance. These finite sample biases become particularly problematic in one-sided inference problems, such as one-sided hypothesis testing. This paper proposes a novel two-step procedure for accurate inference in high-dimensional logistic regression models. In the first step, we propose a Lasso-based variable selection method to select a suitable submodel of moderate size for subsequent inference. In the second step, we introduce a bias corrected estimator to fit the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models