On Learning Mixtures of Well-Separated Gaussians

Oded Regev; Aravindan Vijayaraghavan

arXiv:1710.11592·cs.DS·November 1, 2017

On Learning Mixtures of Well-Separated Gaussians

Oded Regev, Aravindan Vijayaraghavan

PDF

TL;DR

This paper investigates the minimum separation needed between Gaussian mixture components for efficient learning, establishing bounds that characterize the optimal separation order for polynomial sample complexity.

Contribution

The authors provide new bounds on the separation required for learning Gaussian mixtures, introducing an accuracy boosting algorithm and analyzing the sample complexity.

Findings

01

Separation below o(√log k) requires super-polynomial samples.

02

Separation of Ω(√log k) suffices with polynomial samples.

03

An efficient accuracy boosting algorithm achieves arbitrary precision estimates.

Abstract

We consider the problem of efficiently learning mixtures of a large number of spherical Gaussians, when the components of the mixture are well separated. In the most basic form of this problem, we are given samples from a uniform mixture of $k$ standard spherical Gaussians, and the goal is to estimate the means up to accuracy $δ$ using $p o l y (k, d, 1/ δ)$ samples. In this work, we study the following question: what is the minimum separation needed between the means for solving this task? The best known algorithm due to Vempala and Wang [JCSS 2004] requires a separation of roughly $min {k, d}^{1/4}$ . On the other hand, Moitra and Valiant [FOCS 2010] showed that with separation $o (1)$ , exponentially many samples are required. We address the significant gap between these two bounds, by showing the following results. 1. We show that with separation $o (lo g k)$ ,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.