Second order stochastic gradient update for Cholesky factor in Gaussian   variational approximation from Stein's Lemma

Linda S. L. Tan

arXiv:2210.10566·stat.ME·October 20, 2022

Second order stochastic gradient update for Cholesky factor in Gaussian variational approximation from Stein's Lemma

Linda S. L. Tan

PDF

Open Access

TL;DR

This paper introduces a second order stochastic gradient method for updating the Cholesky factor in Gaussian variational inference, leveraging Stein's Lemma to improve convergence and reduce variance.

Contribution

It derives a novel second order unbiased gradient estimate for the Cholesky factor using Stein's Lemma, enhancing variational inference updates.

Findings

01

Second order updates reduce variance near the mode.

02

Method improves convergence speed in Gaussian variational inference.

03

Applicable to sparse precision matrices with conditional independence.

Abstract

In stochastic variational inference, use of the reparametrization trick for the multivariate Gaussian gives rise to efficient updates for the mean and Cholesky factor of the covariance matrix, which depend on the first order derivative of the log joint model density. In this article, we show that an alternative unbiased gradient estimate for the Cholesky factor which depends on the second order derivative of the log joint model density can be derived using Stein's Lemma. This leads to a second order stochastic gradient update for the Cholesky factor which is able to improve convergence, as it has variance lower than the first order update (almost negligible) when close to the mode. We also derive second order update for the Cholesky factor of the precision matrix, which is useful when the precision matrix has a sparse structure reflecting conditional independence in the true posterior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Bayesian Inference · Markov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models