Mixture Models, Robustness, and Sum of Squares Proofs

Samuel B. Hopkins; Jerry Li

arXiv:1711.07454·cs.DS·November 21, 2017·5 cites

Mixture Models, Robustness, and Sum of Squares Proofs

Samuel B. Hopkins, Jerry Li

PDF

Open Access 1 Video

TL;DR

This paper introduces new algorithms using the Sum of Squares method for learning Gaussian mixtures and robust mean estimation in high dimensions, significantly improving statistical guarantees over previous methods.

Contribution

It presents the first efficient algorithms that surpass classical barriers for Gaussian mixture separation and approaches the information-theoretic limits in robust estimation.

Findings

01

Improved algorithm for Gaussian mixture separation at separation $k^{ ext{epsilon}}$

02

Robust mean estimation with error approaching the information-theoretic limit

03

Unified Sum of Squares based technique for high-dimensional distribution learning

Abstract

We use the Sum of Squares method to develop new efficient algorithms for learning well-separated mixtures of Gaussians and robust mean estimation, both in high dimensions, that substantially improve upon the statistical guarantees achieved by previous efficient algorithms. Firstly, we study mixtures of $k$ distributions in $d$ dimensions, where the means of every pair of distributions are separated by at least $k^{ε}$ . In the special case of spherical Gaussian mixtures, we give a $(d k)^{O (1/ ε^{2})}$ -time algorithm that learns the means assuming separation at least $k^{ε}$ , for any $ε > 0$ . This is the first algorithm to improve on greedy ("single-linkage") and spectral clustering, breaking a long-standing barrier for efficient algorithms at separation $k^{1/4}$ . We also study robust estimation. When an unknown $(1 - ε)$ -fraction of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Mixture Models, Robustness, and Sum of Squares Proofs· youtube

Taxonomy

TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Advanced Statistical Methods and Models