De-biased Lasso for Generalized Linear Models with A Diverging Number of   Covariates

Lu Xia; Bin Nan; Yi Li

arXiv:2110.04433·stat.ME·November 18, 2021

De-biased Lasso for Generalized Linear Models with A Diverging Number of Covariates

Lu Xia, Bin Nan, Yi Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new de-biased lasso method for generalized linear models that effectively handles high-dimensional data with diverging covariates, providing valid confidence intervals without requiring sparsity assumptions.

Contribution

It proposes a bias-reduction approach by directly inverting the Hessian matrix, improving inference accuracy in large p, small n settings for GLMs.

Findings

01

Method achieves nominal coverage probabilities in simulations.

02

Performs well in bias reduction and confidence interval accuracy.

03

Successfully applied to lung cancer genetic data analysis.

Abstract

Modeling and drawing inference on the joint associations between single nucleotide polymorphisms and a disease has sparked interest in genome-wide associations studies. In the motivating Boston Lung Cancer Survival Cohort (BLCSC) data, the presence of a large number of single nucleotide polymorphisms of interest, though smaller than the sample size, challenges inference on their joint associations with the disease outcome. In similar settings, we find that neither the de-biased lasso approach (van de Geer et al. 2014), which assumes sparsity on the inverse information matrix, nor the standard maximum likelihood method can yield confidence intervals with satisfactory coverage probabilities for generalized linear models. Under this "large $n$ , diverging $p$ " scenario, we propose an alternative de-biased lasso approach by directly inverting the Hessian matrix without imposing the matrix…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luxia-bios/DebiasedLassoGLMs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Genetic Associations and Epidemiology · Advanced Causal Inference Techniques