An Ensemble EM Algorithm for Bayesian Variable Selection

Jin Wang; Feng Liang; Yuan Ji

arXiv:1603.04360·stat.CO·March 15, 2016·1 cites

An Ensemble EM Algorithm for Bayesian Variable Selection

Jin Wang, Feng Liang, Yuan Ji

PDF

Open Access

TL;DR

This paper introduces an efficient ensemble EM algorithm for Bayesian variable selection in linear regression, capable of handling large datasets and achieving consistent variable selection even as the number of variables grows.

Contribution

It proposes a novel ensemble EM algorithm that improves scalability and accuracy in Bayesian variable selection, addressing local optima issues.

Findings

01

Algorithm scales efficiently with big data

02

Achieves variable selection consistency as p diverges with n

03

Demonstrates superior empirical performance

Abstract

We study the Bayesian approach to variable selection in the context of linear regression. Motivated by a recent work by Rockova and George (2014), we propose an EM algorithm that returns the MAP estimate of the set of relevant variables. Due to its particular updating scheme, our algorithm can be implemented efficiently without inverting a large matrix in each iteration and therefore can scale up with big data. We also show that the MAP estimate returned by our EM algorithm achieves variable selection consistency even when $p$ diverges with $n$ . In practice, our algorithm could get stuck with local modes, a common problem with EM algorithms. To address this issue, we propose an ensemble EM algorithm, in which we repeatedly apply the EM algorithm on a subset of the samples with a subset of the covariates, and then aggregate the variable selection results across those bootstrap…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference