Pathwise Gradient Variance Reduction with Control Variates in Variational Inference
Kenyon Ng, Susan Wei

TL;DR
This paper reviews existing variance reduction techniques for pathwise gradient estimators in variational inference and introduces a new zero-variance control variate method that requires minimal assumptions about the variational distribution.
Contribution
The paper proposes a novel zero-variance control variate approach for pathwise gradient estimators that is broadly applicable and less assumption-dependent.
Findings
Existing control variates often rely on integrand approximations.
Current methods are limited to simple variational families.
The proposed method reduces variance with minimal assumptions.
Abstract
Variational inference in Bayesian deep learning often involves computing the gradient of an expectation that lacks a closed-form solution. In these cases, pathwise and score-function gradient estimators are the most common approaches. The pathwise estimator is often favoured for its substantially lower variance compared to the score-function estimator, which typically requires variance reduction techniques. However, recent research suggests that even pathwise gradient estimators could benefit from variance reduction. In this work, we review existing control-variates-based variance reduction methods for pathwise gradient estimators to assess their effectiveness. Notably, these methods often rely on integrand approximations and are applicable only to simple variational families. To address this limitation, we propose applying zero-variance control variates to pathwise gradient estimators.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis
