Randomly initialized EM algorithm for two-component Gaussian mixture   achieves near optimality in $O(\sqrt{n})$ iterations

Yihong Wu; Harrison H. Zhou

arXiv:1908.10935·math.ST·August 30, 2019·1 cites

Randomly initialized EM algorithm for two-component Gaussian mixture achieves near optimality in $O(\sqrt{n})$ iterations

Yihong Wu, Harrison H. Zhou

PDF

Open Access

TL;DR

This paper proves that the classical EM algorithm, when randomly initialized, converges efficiently to near-optimal estimates in symmetric two-component Gaussian mixtures without component separation, given sufficient sample size.

Contribution

It establishes near-optimal convergence rates for EM in Gaussian mixtures with minimal assumptions, improving upon prior results that required strong separation or initialization conditions.

Findings

01

EM converges in $O(\sqrt{n})$ iterations with high probability.

02

Estimates are within logarithmic factors of the minimax rate.

03

Results hold even with zero Fisher information in the worst case.

Abstract

We analyze the classical EM algorithm for parameter estimation in the symmetric two-component Gaussian mixtures in $d$ dimensions. We show that, even in the absence of any separation between components, provided that the sample size satisfies $n = Ω (d lo g^{3} d)$ , the randomly initialized EM algorithm converges to an estimate in at most $O (n)$ iterations with high probability, which is at most $O ((\frac{d l o g ^{3} n}{n})^{1/4})$ in Euclidean distance from the true parameter and within logarithmic factors of the minimax rate of $(\frac{d}{n})^{1/4}$ . Both the nonparametric statistical rate and the sublinear convergence rate are direct consequences of the zero Fisher information in the worst case. Refined pointwise guarantees beyond worst-case analysis and convergence to the MLE are also shown under mild conditions. This improves the previous result of Balakrishnan et al…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Machine Learning and Algorithms · Algorithms and Data Compression