Hamiltonian Variational Auto-Encoder
Anthony L. Caterini, Arnaud Doucet, Dino Sejdinovic

TL;DR
This paper introduces the Hamiltonian Variational Auto-Encoder (HVAE), a novel approach that leverages Hamiltonian importance sampling to produce low-variance, unbiased estimators of the ELBO and its gradients, enhancing VAE inference.
Contribution
It presents a method to optimally select reverse kernels and build a low-variance estimator using Hamiltonian importance sampling, enabling efficient reparameterization in VAEs.
Findings
Provides a scheme for low-variance unbiased ELBO estimation
Enables reparameterization trick with Hamiltonian importance sampling
Develops a target-informed normalizing flow approach
Abstract
Variational Auto-Encoders (VAEs) have become very popular techniques to perform inference and learning in latent variable models as they allow us to leverage the rich representational power of neural networks to obtain flexible approximations of the posterior of latent variables as well as tight evidence lower bounds (ELBOs). Combined with stochastic variational inference, this provides a methodology scaling to large datasets. However, for this methodology to be practically efficient, it is necessary to obtain low-variance unbiased estimators of the ELBO and its gradients with respect to the parameters of interest. While the use of Markov chain Monte Carlo (MCMC) techniques such as Hamiltonian Monte Carlo (HMC) has been previously suggested to achieve this [23, 26], the proposed methods require specifying reverse kernels which have a large impact on performance. Additionally, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis
