Alpha-Divergences in Variational Dropout

Bogdan Mazoure; Riashat Islam

arXiv:1711.04345·stat.ML·November 15, 2017·1 cites

Alpha-Divergences in Variational Dropout

Bogdan Mazoure, Riashat Islam

PDF

Open Access

TL;DR

This paper explores the use of Alpha-Divergences instead of KL divergence in variational dropout, demonstrating that Alpha-Divergences can improve training and inference in variational Bayesian models.

Contribution

It extends variational dropout to incorporate Alpha-Divergences, providing a new approach that can outperform standard methods in training neural networks.

Findings

01

Alpha-Divergences with alpha near 1 perform well in variational dropout.

02

Alpha-Divergences can yield lower training error than standard KL-based methods.

03

Using Alpha-Divergences offers a flexible alternative for variational inference.

Abstract

We investigate the use of alternative divergences to Kullback-Leibler (KL) in variational inference(VI), based on the Variational Dropout \cite{kingma2015}. Stochastic gradient variational Bayes (SGVB) \cite{aevb} is a general framework for estimating the evidence lower bound (ELBO) in Variational Bayes. In this work, we extend the SGVB estimator with using Alpha-Divergences, which are alternative to divergences to VI' KL objective. The Gaussian dropout can be seen as a local reparametrization trick of the SGVB objective. We extend the Variational Dropout to use alpha divergences for variational inference. Our results compare $α$ -divergence variational dropout with standard variational dropout with correlated and uncorrelated weight noise. We show that the $α$ -divergence with $α \to 1$ (or KL divergence) is still a good measure for use in variational inference, in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification

MethodsVariational Dropout · Dropout