Fast Incremental Expectation Maximization for finite-sum optimization:   nonasymptotic convergence

Gersende Fort (IMT); P. Gach (IMT); E. Moulines (CMAP; XPOP)

arXiv:2012.14670·cs.LG·January 1, 2021

Fast Incremental Expectation Maximization for finite-sum optimization: nonasymptotic convergence

Gersende Fort (IMT), P. Gach (IMT), E. Moulines (CMAP, XPOP)

PDF

Open Access

TL;DR

This paper introduces nonasymptotic convergence bounds for the Fast Incremental Expectation Maximization (FIEM) algorithm, improving theoretical rates and providing practical strategies for large-scale finite-sum optimization problems.

Contribution

The paper recasts FIEM within a stochastic approximation framework and derives nonasymptotic convergence bounds, achieving better rates and practical strategies for large datasets.

Findings

01

Achieves convergence rate scaling as for xamples, better than previous n^{2/3} rate.

02

Provides two strategies for psilon-approximate stationary points with different iteration complexities.

03

Numerical results show improved step size choices and convergence control.

Abstract

Fast Incremental Expectation Maximization (FIEM) is a version of the EM framework for large datasets. In this paper, we first recast FIEM and other incremental EM type algorithms in the {\em Stochastic Approximation within EM} framework. Then, we provide nonasymptotic bounds for the convergence in expectation as a function of the number of examples $n$ and of the maximal number of iterations $\kmax$ . We propose two strategies for achieving an $ϵ$ -approximate stationary point, respectively with $\kmax = O (n^{2/3} / ϵ)$ and $\kmax = O (n / ϵ^{3/2})$ , both strategies relying on a random termination rule before $\kmax$ and on a constant step size in the Stochastic Approximation step. Our bounds provide some improvements on the literature. First, they allow $\kmax$ to scale as $n$ which is better than $n^{2/3}$ which was the best rate obtained so far; it is at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference