De-biased Lasso for Generalized Linear Models with A Diverging Number of Covariates
Lu Xia, Bin Nan, Yi Li

TL;DR
This paper introduces a new de-biased lasso method for generalized linear models that effectively handles high-dimensional data with diverging covariates, providing valid confidence intervals without requiring sparsity assumptions.
Contribution
It proposes a bias-reduction approach by directly inverting the Hessian matrix, improving inference accuracy in large p, small n settings for GLMs.
Findings
Method achieves nominal coverage probabilities in simulations.
Performs well in bias reduction and confidence interval accuracy.
Successfully applied to lung cancer genetic data analysis.
Abstract
Modeling and drawing inference on the joint associations between single nucleotide polymorphisms and a disease has sparked interest in genome-wide associations studies. In the motivating Boston Lung Cancer Survival Cohort (BLCSC) data, the presence of a large number of single nucleotide polymorphisms of interest, though smaller than the sample size, challenges inference on their joint associations with the disease outcome. In similar settings, we find that neither the de-biased lasso approach (van de Geer et al. 2014), which assumes sparsity on the inverse information matrix, nor the standard maximum likelihood method can yield confidence intervals with satisfactory coverage probabilities for generalized linear models. Under this "large , diverging " scenario, we propose an alternative de-biased lasso approach by directly inverting the Hessian matrix without imposing the matrix…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Genetic Associations and Epidemiology · Advanced Causal Inference Techniques
