Graph Reparameterizations for Enabling 1000+ Monte Carlo Iterations in Bayesian Deep Neural Networks
Jurijs Nazarovs, Ronak R. Mehta, Vishnu Suresh Lokhande, Vikas Singh

TL;DR
This paper introduces a framework for reparameterizing distributions in Bayesian neural networks, enabling over 1000 Monte Carlo iterations efficiently, which improves uncertainty estimation and training stability in large models.
Contribution
The authors develop a novel framework for distribution reparameterizations that decouples computation graph size from the number of MC samples, allowing scalable uncertainty estimation.
Findings
Enabling over 1000 MC iterations improves model confidence.
Reparameterizations lead to more stable training and better performance.
Framework applies to large architectures in computer vision.
Abstract
Uncertainty estimation in deep models is essential in many real-world applications and has benefited from developments over the last several years. Recent evidence suggests that existing solutions dependent on simple Gaussian formulations may not be sufficient. However, moving to other distributions necessitates Monte Carlo (MC) sampling to estimate quantities such as the KL divergence: it could be expensive and scales poorly as the dimensions of both the input data and the model grow. This is directly related to the structure of the computation graph, which can grow linearly as a function of the number of MC samples needed. Here, we construct a framework to describe these computation graphs, and identify probability families where the graph size can be independent or only weakly dependent on the number of MC samples. These families correspond directly to large classes of distributions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference · Gaussian Processes and Bayesian Inference
