# Monte Carlo Co-Ordinate Ascent Variational Inference

**Authors:** Lifeng Ye, Alexandros Beskos, Maria De Iorio, and Jie Hao

arXiv: 1905.03760 · 2019-10-18

## TL;DR

This paper introduces MC-CAVI, a Monte Carlo-based coordinate ascent variational inference algorithm that effectively handles complex models with hard constraints, demonstrating competitive performance with MCMC and BBVI in various applications.

## Contribution

The paper proposes MC-CAVI, integrating MCMC into coordinate ascent VI, enabling efficient inference in constrained complex models where traditional methods struggle.

## Key findings

- MC-CAVI converges to the ELBO maximizer with high probability.
- MC-CAVI performs comparably to MCMC in complex NMR models.
- MC-CAVI is suitable for models with hard constraints, outperforming BBVI in such settings.

## Abstract

In Variational Inference (VI), coordinate-ascent and gradient-based approaches are two major types of algorithms for approximating difficult-to-compute probability densities. In real-world implementations of complex models, Monte Carlo methods are widely used to estimate expectations in coordinate-ascent approaches and gradients in derivative-driven ones. We discuss a Monte Carlo Co-ordinate Ascent VI (MC-CAVI) algorithm that makes use of Markov chain Monte Carlo (MCMC) methods in the calculation of expectations required within Co-ordinate Ascent VI (CAVI). We show that, under regularity conditions, an MC-CAVI recursion will get arbitrarily close to a maximiser of the evidence lower bound (ELBO) with any given high probability. In numerical examples, the performance of MC-CAVI algorithm is compared with that of MCMC and -- as a representative of derivative-based VI methods -- of Black Box VI (BBVI). We discuss and demonstrate MC-CAVI's suitability for models with hard constraints in simulated and real examples. We compare MC-CAVI's performance with that of MCMC in an important complex model used in Nuclear Magnetic Resonance (NMR) spectroscopy data analysis -- BBVI is nearly impossible to be employed in this setting due to the hard constraints involved in the model.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.03760/full.md

## Figures

35 figures with captions in the complete paper: https://tomesphere.com/paper/1905.03760/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/1905.03760/full.md

---
Source: https://tomesphere.com/paper/1905.03760