A Class of Two-Timescale Stochastic EM Algorithms for Nonconvex Latent   Variable Models

Belhal Karimi; Ping Li

arXiv:2203.10186·stat.ML·March 22, 2022

A Class of Two-Timescale Stochastic EM Algorithms for Nonconvex Latent Variable Models

Belhal Karimi, Ping Li

PDF

Open Access

TL;DR

This paper introduces a novel class of Two-Timescale Stochastic EM algorithms designed for nonconvex latent variable models, combining variance reduction techniques to improve convergence and scalability.

Contribution

It proposes a general Two-Timescale EM framework with convergence guarantees, addressing nonconvexity and large datasets in latent variable model learning.

Findings

01

Finite-time convergence bounds established

02

Effective variance reduction demonstrated

03

Numerical experiments show improved performance

Abstract

The Expectation-Maximization (EM) algorithm is a popular choice for learning latent variable models. Variants of the EM have been initially introduced, using incremental updates to scale to large datasets, and using Monte Carlo (MC) approximations to bypass the intractable conditional expectation of the latent data for most nonconvex models. In this paper, we propose a general class of methods called Two-Timescale EM Methods based on a two-stage approach of stochastic updates to tackle an essential nonconvex optimization task for latent variable models. We motivate the choice of a double dynamic by invoking the variance reduction virtue of each stage of the method on both sources of noise: the index sampling for the incremental update and the MC approximation. We establish finite-time and global convergence bounds for nonconvex objective functions. Numerical applications on various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Markov Chains and Monte Carlo Methods