A Class of Two-Timescale Stochastic EM Algorithms for Nonconvex Latent Variable Models
Belhal Karimi, Ping Li

TL;DR
This paper introduces a novel class of Two-Timescale Stochastic EM algorithms designed for nonconvex latent variable models, combining variance reduction techniques to improve convergence and scalability.
Contribution
It proposes a general Two-Timescale EM framework with convergence guarantees, addressing nonconvexity and large datasets in latent variable model learning.
Findings
Finite-time convergence bounds established
Effective variance reduction demonstrated
Numerical experiments show improved performance
Abstract
The Expectation-Maximization (EM) algorithm is a popular choice for learning latent variable models. Variants of the EM have been initially introduced, using incremental updates to scale to large datasets, and using Monte Carlo (MC) approximations to bypass the intractable conditional expectation of the latent data for most nonconvex models. In this paper, we propose a general class of methods called Two-Timescale EM Methods based on a two-stage approach of stochastic updates to tackle an essential nonconvex optimization task for latent variable models. We motivate the choice of a double dynamic by invoking the variance reduction virtue of each stage of the method on both sources of noise: the index sampling for the incremental update and the MC approximation. We establish finite-time and global convergence bounds for nonconvex objective functions. Numerical applications on various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Markov Chains and Monte Carlo Methods
