Sharp Analysis of Expectation-Maximization for Weakly Identifiable   Models

Raaz Dwivedi; Nhat Ho; Koulik Khamaru; Martin J. Wainwright; Michael; I. Jordan; Bin Yu

arXiv:1902.00194·math.ST·November 17, 2021·5 cites

Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models

Raaz Dwivedi, Nhat Ho, Koulik Khamaru, Martin J. Wainwright, Michael, I. Jordan, Bin Yu

PDF

Open Access

TL;DR

This paper rigorously analyzes the slow convergence of the EM algorithm in weakly identifiable Gaussian mixture models, revealing convergence rates of order $n^{3/4}$ steps and estimation errors of order $n^{-1/8}$ and $n^{-1/4}$ in univariate cases.

Contribution

It provides the first detailed characterization of EM's slow convergence in weakly identifiable models, introducing a novel two-stage localization argument and extending results to multivariate cases.

Findings

01

EM converges in order $n^{3/4}$ steps in univariate models.

02

Estimates are within $n^{-1/8}$ and $n^{-1/4}$ of true parameters.

03

Similar slow rates are observed in multivariate models.

Abstract

We study a class of weakly identifiable location-scale mixture models for which the maximum likelihood estimates based on $n$ i.i.d. samples are known to have lower accuracy than the classical $n^{- \frac{1}{2}}$ error. We investigate whether the Expectation-Maximization (EM) algorithm also converges slowly for these models. We provide a rigorous characterization of EM for fitting a weakly identifiable Gaussian mixture in a univariate setting where we prove that the EM algorithm converges in order $n^{\frac{3}{4}}$ steps and returns estimates that are at a Euclidean distance of order $n^{- \frac{1}{8}}$ and $n^{- \frac{1}{4}}$ from the true location and scale parameter respectively. Establishing the slow rates in the univariate setting requires a novel localization argument with two stages, with each stage involving an epoch-based argument applied to a different surrogate EM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Statistical Methods and Inference