The More, the Merrier: the Blessing of Dimensionality for Learning Large Gaussian Mixtures
Joseph Anderson, Mikhail Belkin, Navin Goyal, Luis Rademacher, James, Voss

TL;DR
This paper demonstrates that large Gaussian mixture models are efficiently learnable in high dimensions under certain conditions, revealing a surprising benefit of high dimensionality for statistical inference.
Contribution
It introduces a new Poissonization-based technique for learning large Gaussian mixtures in high dimensions and establishes the first exponential lower bounds for low-dimensional ICA.
Findings
High-dimensional Gaussian mixtures are polynomially learnable under non-degeneracy conditions.
A novel Poissonization technique transforms mixture learning into tensor decomposition and ICA problems.
Low-dimensional Gaussian mixtures are shown to be computationally hard to learn, with exponential lower bounds.
Abstract
In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimension. More precisely, we prove that a mixture with known identical covariance matrices whose number of components is a polynomial of any fixed degree in the dimension n is polynomially learnable as long as a certain non-degeneracy condition on the means is satisfied. It turns out that this condition is generic in the sense of smoothed complexity, as soon as the dimensionality of the space is high enough. Moreover, we prove that no such condition can possibly exist in low dimension and the problem of learning the parameters is generically hard. In contrast, much of the existing work on Gaussian Mixtures relies on low-dimensional projections and thus hits an artificial barrier. Our main result on mixture recovery relies on a new "Poissonization"-based technique, which transforms a mixture of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Machine Learning and Algorithms · Bayesian Methods and Mixture Models
MethodsIndependent Component Analysis
