Variational Tempering
Stephan Mandt, James McInerney, Farhan Abrol, Rajesh Ranganath, and, David Blei

TL;DR
This paper introduces variational tempering, an adaptive annealing method for variational inference that uses latent temperature variables to improve optimization and predictive performance on large datasets.
Contribution
The paper proposes variational tempering with adaptive and local annealing schedules, enhancing variational inference for better posterior approximation.
Findings
Improved predictive likelihoods on held-out data.
Adaptive annealing schedules outperform fixed schedules.
Local tempering varies temperature across data points.
Abstract
Variational inference (VI) combined with data subsampling enables approximate posterior inference over large data sets, but suffers from poor local optima. We first formulate a deterministic annealing approach for the generic class of conditionally conjugate exponential family models. This approach uses a decreasing temperature parameter which deterministically deforms the objective during the course of the optimization. A well-known drawback to this annealing approach is the choice of the cooling schedule. We therefore introduce variational tempering, a variational algorithm that introduces a temperature latent variable to the model. In contrast to related work in the Markov chain Monte Carlo literature, this algorithm results in adaptive annealing schedules. Lastly, we develop local variational tempering, which assigns a latent temperature to each data point; this allows for dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Gaussian Processes and Bayesian Inference · Statistical Methods and Inference
