Variational Bayesian dropout: pitfalls and fixes
Jiri Hron, Alexander G. de G. Matthews, Zoubin Ghahramani

TL;DR
This paper critically examines the Bayesian dropout framework, identifies fundamental issues with improper priors and singularities, and proposes Quasi-KL divergence as a novel solution to improve variational inference in neural networks.
Contribution
It highlights the pitfalls of existing Bayesian dropout methods and introduces Quasi-KL divergence to address the singularity problem in variational inference.
Findings
The variational Gaussian dropout framework suffers from irredeemable pathologies due to improper priors.
QKL divergence provides a well-defined objective for high-dimensional distribution approximation.
QKL converges to variational Bernoulli dropout under certain limits, connecting discretisation and noise motivations.
Abstract
Dropout, a stochastic regularisation technique for training of neural networks, has recently been reinterpreted as a specific type of approximate inference algorithm for Bayesian neural networks. The main contribution of the reinterpretation is in providing a theoretical framework useful for analysing and extending the algorithm. We show that the proposed framework suffers from several issues; from undefined or pathological behaviour of the true posterior related to use of improper priors, to an ill-defined variational objective due to singularity of the approximating distribution relative to the true posterior. Our analysis of the improper log uniform prior used in variational Gaussian dropout suggests the pathologies are generally irredeemable, and that the algorithm still works only because the variational formulation annuls some of the pathologies. To address the singularity issue,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Blind Source Separation Techniques · Bayesian Modeling and Causal Inference
MethodsDropout
