Efficient Deep Learning of GMMs

Shirin Jalali; Carl Nuzman; Iraj Saniee

arXiv:1902.05707·cs.LG·February 18, 2019·1 cites

Efficient Deep Learning of GMMs

Shirin Jalali, Carl Nuzman, Iraj Saniee

PDF

Open Access

TL;DR

This paper demonstrates that deep neural networks can efficiently classify Gaussian mixture models in high-dimensional spaces using linear neuron counts, unlike shallow networks which require exponentially more resources.

Contribution

It proves that deep neural networks can classify GMMs in high dimensions with linear complexity, highlighting the efficiency of depth in neural network architectures.

Findings

01

Deep networks classify GMMs with O(n) neurons.

02

Shallow networks need exponential neurons for the same task.

03

Results explain the practical success of deep learning in high-dimensional data.

Abstract

We show that a collection of Gaussian mixture models (GMMs) in $R^{n}$ can be optimally classified using $O (n)$ neurons in a neural network with two hidden layers (deep neural network), whereas in contrast, a neural network with a single hidden layer (shallow neural network) would require at least $O (exp (n))$ neurons or possibly exponentially large coefficients. Given the universality of the Gaussian distribution in the feature spaces of data, e.g., in speech, image and text, our result sheds light on the observed efficiency of deep neural networks in practical classification problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Speech Recognition and Synthesis · Music and Audio Processing