Variable Selection for High-dimensional Generalized Linear Models using an Iterated Conditional Modes/Medians Algorithm
Vitara Pungpapong, Min Zhang, Dabao Zhang

TL;DR
This paper introduces an extension of the ICM/M algorithm for variable selection in high-dimensional generalized linear models, effectively combining empirical Bayes methods with iterative optimization for genomic data analysis.
Contribution
It develops a novel ICM/M algorithm tailored for high-dimensional GLMs, incorporating spike-and-slab priors and network information, with implementation in an accessible R package.
Findings
Demonstrates superior variable selection accuracy in simulations.
Effectively models binary logistic and Cox proportional hazards data.
Provides a practical R package for implementation.
Abstract
High-dimensional linear and nonlinear models have been extensively used to identify associations between response and explanatory variables. The variable selection problem is commonly of interest in the presence of massive and complex data. An empirical Bayes model for high-dimensional generalized linear models (GLMs) is considered in this paper. The extension of the Iterated Conditional Modes/Medians (ICM/M) algorithm is proposed to build up a GLM. With the construction of pseudodata and pseudovariances based on iteratively reweighted least squares (IRLS), conditional modes are employed to obtain data-drive optimal values for hyperparameters and conditional medians are used to estimate regression coefficients. With a spike-and-slab prior for each coefficient, a conditional median can enforce variable estimation and selection at the same time. The ICM/M algorithm can also incorporate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Gene Regulatory Network Analysis · Genetic Mapping and Diversity in Plants and Animals
