Bayesian variable selection for high dimensional generalized linear models: convergence rates of the fitted densities
Wenxin Jiang

TL;DR
This paper demonstrates that Bayesian variable selection in high-dimensional generalized linear models can effectively reduce overfitting and achieve near-parametric convergence rates in the posterior density, even when the number of variables exceeds the sample size.
Contribution
It extends existing results by showing convergence rates of the fitted densities in high-dimensional GLMs under Bayesian variable selection, especially when most variables have negligible effects.
Findings
Posterior densities often close to true density in Hellinger distance
Convergence rate near the parametric rate of n^{-1/2}
Applicable when the number of variables exceeds sample size
Abstract
Bayesian variable selection has gained much empirical success recently in a variety of applications when the number of explanatory variables is possibly much larger than the sample size . For generalized linear models, if most of the 's have very small effects on the response , we show that it is possible to use Bayesian variable selection to reduce overfitting caused by the curse of dimensionality . In this approach a suitable prior can be used to choose a few out of the many 's to model , so that the posterior will propose probability densities that are ``often close'' to the true density in some sense. The closeness can be described by a Hellinger distance between and that scales at a power very close to , which is the ``finite-dimensional rate'' corresponding to a low-dimensional situation. These findings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
