Approximation Based Variance Reduction for Reparameterization Gradients

Tomas Geffner; Justin Domke

arXiv:2007.14634·cs.LG·October 26, 2020

Approximation Based Variance Reduction for Reparameterization Gradients

Tomas Geffner, Justin Domke

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a quadratic approximation control variate for reparameterization gradients, significantly reducing variance and improving optimization in variational inference with complex distributions.

Contribution

It proposes a novel control variate applicable to any reparameterizable distribution with known mean and covariance, enhancing gradient estimation efficiency.

Findings

01

Large variance reduction in gradient estimates

02

Improved convergence in variational inference

03

Effective for non-factorized variational distributions

Abstract

Flexible variational distributions improve variational inference but are harder to optimize. In this work we present a control variate that is applicable for any reparameterizable distribution with known mean and covariance matrix, e.g. Gaussians with any covariance structure. The control variate is based on a quadratic approximation of the model, and its parameters are set using a double-descent scheme by minimizing the gradient estimator's variance. We empirically show that this control variate leads to large improvements in gradient variance and optimization convergence for inference with non-factorized variational distributions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tomsons22/ABVRR
pytorch

Videos

Approximation Based Variance Reduction for Reparameterization Gradients· slideslive

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Model Reduction and Neural Networks · Neural Networks and Applications