Bayesian variable selection for high dimensional generalized linear   models: convergence rates of the fitted densities

Wenxin Jiang

arXiv:0710.3458·math.ST·September 29, 2009

Bayesian variable selection for high dimensional generalized linear models: convergence rates of the fitted densities

Wenxin Jiang

PDF

TL;DR

This paper demonstrates that Bayesian variable selection in high-dimensional generalized linear models can effectively reduce overfitting and achieve near-parametric convergence rates in the posterior density, even when the number of variables exceeds the sample size.

Contribution

It extends existing results by showing convergence rates of the fitted densities in high-dimensional GLMs under Bayesian variable selection, especially when most variables have negligible effects.

Findings

01

Posterior densities often close to true density in Hellinger distance

02

Convergence rate near the parametric rate of n^{-1/2}

03

Applicable when the number of variables exceeds sample size

Abstract

Bayesian variable selection has gained much empirical success recently in a variety of applications when the number $K$ of explanatory variables $(x_{1}, ..., x_{K})$ is possibly much larger than the sample size $n$ . For generalized linear models, if most of the $x_{j}$ 's have very small effects on the response $y$ , we show that it is possible to use Bayesian variable selection to reduce overfitting caused by the curse of dimensionality $K ≫ n$ . In this approach a suitable prior can be used to choose a few out of the many $x_{j}$ 's to model $y$ , so that the posterior will propose probability densities $p$ that are ``often close'' to the true density $p^{*}$ in some sense. The closeness can be described by a Hellinger distance between $p$ and $p^{*}$ that scales at a power very close to $n^{- 1/2}$ , which is the ``finite-dimensional rate'' corresponding to a low-dimensional situation. These findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.