Mixture Models, Robustness, and Sum of Squares Proofs
Samuel B. Hopkins, Jerry Li

TL;DR
This paper introduces new algorithms using the Sum of Squares method for learning Gaussian mixtures and robust mean estimation in high dimensions, significantly improving statistical guarantees over previous methods.
Contribution
It presents the first efficient algorithms that surpass classical barriers for Gaussian mixture separation and approaches the information-theoretic limits in robust estimation.
Findings
Improved algorithm for Gaussian mixture separation at separation $k^{ ext{epsilon}}$
Robust mean estimation with error approaching the information-theoretic limit
Unified Sum of Squares based technique for high-dimensional distribution learning
Abstract
We use the Sum of Squares method to develop new efficient algorithms for learning well-separated mixtures of Gaussians and robust mean estimation, both in high dimensions, that substantially improve upon the statistical guarantees achieved by previous efficient algorithms. Firstly, we study mixtures of distributions in dimensions, where the means of every pair of distributions are separated by at least . In the special case of spherical Gaussian mixtures, we give a -time algorithm that learns the means assuming separation at least , for any . This is the first algorithm to improve on greedy ("single-linkage") and spectral clustering, breaking a long-standing barrier for efficient algorithms at separation . We also study robust estimation. When an unknown -fraction of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Mixture Models, Robustness, and Sum of Squares Proofs· youtube
Taxonomy
TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Advanced Statistical Methods and Models
