From the Expectation Maximisation Algorithm to Autoencoded Variational Bayes
Graham W. Pulford

TL;DR
This paper provides a comprehensive tutorial linking the classical EM algorithm to modern autoencoded variational Bayes and variational autoencoders, clarifying key concepts and their evolution.
Contribution
It offers a detailed, unified explanation of EM, variational Bayesian inference, and autoencoders, highlighting their connections and clarifying the meaning of latent variables.
Findings
Clarifies the relationship between EM and variational inference.
Explains the reparametrisation trick in detail.
Provides numerical examples illustrating the algorithms.
Abstract
Although the expectation maximisation (EM) algorithm was introduced in 1970, it remains somewhat inaccessible to machine learning practitioners due to its obscure notation, terse proofs and lack of concrete links to modern machine learning techniques like autoencoded variational Bayes. This has resulted in gaps in the AI literature concerning the meaning of such concepts like "latent variables" and "variational lower bound," which are frequently used but often not clearly explained. The roots of these ideas lie in the EM algorithm. We first give a tutorial presentation of the EM algorithm for estimating the parameters of a -component mixture density. The Gaussian mixture case is presented in detail using -ary scalar hidden (or latent) variables rather than the more traditional binary valued -dimenional vectors. This presentation is motivated by mixture modelling from the target…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Machine Learning and Algorithms
