Fast Incremental Expectation Maximization for finite-sum optimization: nonasymptotic convergence
Gersende Fort (IMT), P. Gach (IMT), E. Moulines (CMAP, XPOP)

TL;DR
This paper introduces nonasymptotic convergence bounds for the Fast Incremental Expectation Maximization (FIEM) algorithm, improving theoretical rates and providing practical strategies for large-scale finite-sum optimization problems.
Contribution
The paper recasts FIEM within a stochastic approximation framework and derives nonasymptotic convergence bounds, achieving better rates and practical strategies for large datasets.
Findings
Achieves convergence rate scaling as for xamples, better than previous n^{2/3} rate.
Provides two strategies for psilon-approximate stationary points with different iteration complexities.
Numerical results show improved step size choices and convergence control.
Abstract
Fast Incremental Expectation Maximization (FIEM) is a version of the EM framework for large datasets. In this paper, we first recast FIEM and other incremental EM type algorithms in the {\em Stochastic Approximation within EM} framework. Then, we provide nonasymptotic bounds for the convergence in expectation as a function of the number of examples and of the maximal number of iterations . We propose two strategies for achieving an -approximate stationary point, respectively with and , both strategies relying on a random termination rule before and on a constant step size in the Stochastic Approximation step. Our bounds provide some improvements on the literature. First, they allow to scale as which is better than which was the best rate obtained so far; it is at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference
