The More, the Merrier: the Blessing of Dimensionality for Learning Large   Gaussian Mixtures

Joseph Anderson; Mikhail Belkin; Navin Goyal; Luis Rademacher; James; Voss

arXiv:1311.2891·cs.LG·February 19, 2014·31 cites

The More, the Merrier: the Blessing of Dimensionality for Learning Large Gaussian Mixtures

Joseph Anderson, Mikhail Belkin, Navin Goyal, Luis Rademacher, James, Voss

PDF

Open Access

TL;DR

This paper demonstrates that large Gaussian mixture models are efficiently learnable in high dimensions under certain conditions, revealing a surprising benefit of high dimensionality for statistical inference.

Contribution

It introduces a new Poissonization-based technique for learning large Gaussian mixtures in high dimensions and establishes the first exponential lower bounds for low-dimensional ICA.

Findings

01

High-dimensional Gaussian mixtures are polynomially learnable under non-degeneracy conditions.

02

A novel Poissonization technique transforms mixture learning into tensor decomposition and ICA problems.

03

Low-dimensional Gaussian mixtures are shown to be computationally hard to learn, with exponential lower bounds.

Abstract

In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimension. More precisely, we prove that a mixture with known identical covariance matrices whose number of components is a polynomial of any fixed degree in the dimension n is polynomially learnable as long as a certain non-degeneracy condition on the means is satisfied. It turns out that this condition is generic in the sense of smoothed complexity, as soon as the dimensionality of the space is high enough. Moreover, we prove that no such condition can possibly exist in low dimension and the problem of learning the parameters is generically hard. In contrast, much of the existing work on Gaussian Mixtures relies on low-dimensional projections and thus hits an artificial barrier. Our main result on mixture recovery relies on a new "Poissonization"-based technique, which transforms a mixture of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Machine Learning and Algorithms · Bayesian Methods and Mixture Models

MethodsIndependent Component Analysis