TL;DR
This paper introduces a fully Bayesian MCMC method for high-dimensional logistic regression using hyper-LASSO priors, effectively handling multi-modal posteriors for feature selection in complex datasets.
Contribution
It develops a novel Hamiltonian Monte Carlo-based Bayesian approach for hyper-LASSO penalized logistic regression, addressing a gap in fully Bayesian high-dimensional feature selection methods.
Findings
Hyper-LASSO priors outperform traditional penalties in simulations.
The method effectively identifies relevant features in real genomic data.
Hyper-LASSO prior parameters influence model sparsity and performance.
Abstract
High-dimensional feature selection arises in many areas of modern science. For example, in genomic research we want to find the genes that can be used to separate tissues of different classes (e.g. cancer and normal) from tens of thousands of genes that are active (expressed) in certain tissue cells. To this end, we wish to fit regression and classification models with a large number of features (also called variables, predictors). In the past decade, penalized likelihood methods for fitting regression models based on hyper-LASSO penalization have received increasing attention in the literature. However, fully Bayesian methods that use Markov chain Monte Carlo (MCMC) are still in lack of development in the literature. In this paper we introduce an MCMC (fully Bayesian) method for learning severely multi-modal posteriors of logistic regression models based on hyper-LASSO priors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLogistic Regression
