Fully Bayesian Logistic Regression with Hyper-Lasso Priors for   High-dimensional Feature Selection

Longhai Li; Weixin Yao

arXiv:1405.3319·stat.CO·July 20, 2018

Fully Bayesian Logistic Regression with Hyper-Lasso Priors for High-dimensional Feature Selection

Longhai Li, Weixin Yao

PDF

1 Repo

TL;DR

This paper introduces a fully Bayesian MCMC method for high-dimensional logistic regression using hyper-LASSO priors, effectively handling multi-modal posteriors for feature selection in complex datasets.

Contribution

It develops a novel Hamiltonian Monte Carlo-based Bayesian approach for hyper-LASSO penalized logistic regression, addressing a gap in fully Bayesian high-dimensional feature selection methods.

Findings

01

Hyper-LASSO priors outperform traditional penalties in simulations.

02

The method effectively identifies relevant features in real genomic data.

03

Hyper-LASSO prior parameters influence model sparsity and performance.

Abstract

High-dimensional feature selection arises in many areas of modern science. For example, in genomic research we want to find the genes that can be used to separate tissues of different classes (e.g. cancer and normal) from tens of thousands of genes that are active (expressed) in certain tissue cells. To this end, we wish to fit regression and classification models with a large number of features (also called variables, predictors). In the past decade, penalized likelihood methods for fitting regression models based on hyper-LASSO penalization have received increasing attention in the literature. However, fully Bayesian methods that use Markov chain Monte Carlo (MCMC) are still in lack of development in the literature. In this paper we introduce an MCMC (fully Bayesian) method for learning severely multi-modal posteriors of logistic regression models based on hyper-LASSO priors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

longhaiSK/HTLR
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLogistic Regression