Learning Mixtures of Gaussians in High Dimensions
Rong Ge, Qingqing Huang, Sham M. Kakade

TL;DR
This paper presents a polynomial-time algorithm for learning Gaussian mixture models in high dimensions under a smoothed analysis framework, overcoming worst-case hardness by leveraging tensor decomposition and structured random matrix analysis.
Contribution
It introduces a novel high-dimensional learning algorithm for Gaussian mixtures under smoothed analysis, utilizing tensor decomposition and new bounds on structured random matrices.
Findings
Learned Gaussian mixtures with polynomial complexity when n > Ω(k^2).
Developed new tensor decomposition techniques exploiting Gaussian moment symmetries.
Established bounds on smallest singular values of structured random matrices.
Abstract
Efficiently learning mixture of Gaussians is a fundamental problem in statistics and learning theory. Given samples coming from a random one out of k Gaussian distributions in Rn, the learning problem asks to estimate the means and the covariance matrices of these Gaussians. This learning problem arises in many areas ranging from the natural sciences to the social sciences, and has also found many machine learning applications. Unfortunately, learning mixture of Gaussians is an information theoretically hard problem: in order to learn the parameters up to a reasonable accuracy, the number of samples required is exponential in the number of Gaussian components in the worst case. In this work, we show that provided we are in high enough dimensions, the class of Gaussian mixtures is learnable in its most general form under a smoothed analysis framework, where the parameters are randomly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
