# Multilevel Monte Carlo Variational Inference

**Authors:** Masahiro Fujisawa, Issei Sato

arXiv: 1902.00468 · 2021-12-03

## TL;DR

This paper introduces a variance reduction framework for variational inference using Multilevel Monte Carlo, improving gradient estimation efficiency and convergence speed through adaptive sample sizing and gradient recycling.

## Contribution

It presents a novel MLMC-based variance reduction method for variational inference, with an adaptive SGD algorithm and theoretical analysis of variance reduction and convergence.

## Key findings

- Faster convergence compared to baseline methods.
- Reduced gradient estimator variance.
- Improved signal-to-noise ratio in gradient estimates.

## Abstract

We propose a variance reduction framework for variational inference using the Multilevel Monte Carlo (MLMC) method. Our framework is built on reparameterized gradient estimators and "recycles" parameters obtained from past update history in optimization. In addition, our framework provides a new optimization algorithm based on stochastic gradient descent (SGD) that adaptively estimates the sample size used for gradient estimation according to the ratio of the gradient variance. We theoretically show that, with our method, the variance of the gradient estimator decreases as optimization proceeds and that a learning rate scheduler function helps improve the convergence. We also show that, in terms of the \textit{signal-to-noise} ratio, our method can improve the quality of gradient estimation by the learning rate scheduler function without increasing the initial sample size. Finally, we confirm that our method achieves faster convergence and reduces the variance of the gradient estimator compared with other methods through experimental comparisons with baseline methods using several benchmark datasets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.00468/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/1902.00468/full.md

## References

64 references — full list in the complete paper: https://tomesphere.com/paper/1902.00468/full.md

---
Source: https://tomesphere.com/paper/1902.00468