Alpha-Divergences in Variational Dropout
Bogdan Mazoure, Riashat Islam

TL;DR
This paper explores the use of Alpha-Divergences instead of KL divergence in variational dropout, demonstrating that Alpha-Divergences can improve training and inference in variational Bayesian models.
Contribution
It extends variational dropout to incorporate Alpha-Divergences, providing a new approach that can outperform standard methods in training neural networks.
Findings
Alpha-Divergences with alpha near 1 perform well in variational dropout.
Alpha-Divergences can yield lower training error than standard KL-based methods.
Using Alpha-Divergences offers a flexible alternative for variational inference.
Abstract
We investigate the use of alternative divergences to Kullback-Leibler (KL) in variational inference(VI), based on the Variational Dropout \cite{kingma2015}. Stochastic gradient variational Bayes (SGVB) \cite{aevb} is a general framework for estimating the evidence lower bound (ELBO) in Variational Bayes. In this work, we extend the SGVB estimator with using Alpha-Divergences, which are alternative to divergences to VI' KL objective. The Gaussian dropout can be seen as a local reparametrization trick of the SGVB objective. We extend the Variational Dropout to use alpha divergences for variational inference. Our results compare -divergence variational dropout with standard variational dropout with correlated and uncorrelated weight noise. We show that the -divergence with (or KL divergence) is still a good measure for use in variational inference, in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification
MethodsVariational Dropout · Dropout
