Analytic natural gradient updates for Cholesky factor in Gaussian   variational approximation

Linda S. L. Tan

arXiv:2109.00375·stat.CO·February 5, 2025·1 cites

Analytic natural gradient updates for Cholesky factor in Gaussian variational approximation

Linda S. L. Tan

PDF

Open Access

TL;DR

This paper derives analytic natural gradient updates for the Cholesky factor in Gaussian variational inference, addressing positive definiteness and sparsity, and proposes a stochastic normalized natural gradient method for complex models.

Contribution

It introduces a novel analytic natural gradient update for the Cholesky factor, enabling efficient and stable optimization in high-dimensional Gaussian variational approximations.

Findings

01

Analytic natural gradient updates improve convergence stability.

02

Method ensures positive definiteness of covariance matrices.

03

Effective in generalized linear mixed models and neural networks.

Abstract

Natural gradients can improve convergence in stochastic variational inference significantly but inverting the Fisher information matrix is daunting in high dimensions. Moreover, in Gaussian variational approximation, natural gradient updates of the precision matrix do not ensure positive definiteness. To tackle this issue, we derive analytic natural gradient updates of the Cholesky factor of the covariance or precision matrix, and consider sparsity constraints representing different posterior correlation structures. Stochastic normalized natural gradient ascent with momentum is proposed for implementation in generalized linear mixed models and deep neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Neural Networks and Applications · Gaussian Processes and Bayesian Inference