Exponential expressivity in deep neural networks through transient chaos

Ben Poole; Subhaneil Lahiri; Maithra Raghu; Jascha Sohl-Dickstein,; Surya Ganguli

arXiv:1606.05340·stat.ML·June 22, 2016·160 cites

Exponential expressivity in deep neural networks through transient chaos

Ben Poole, Subhaneil Lahiri, Maithra Raghu, Jascha Sohl-Dickstein,, Surya Ganguli

PDF

Open Access 1 Repo

TL;DR

This paper combines Riemannian geometry and chaos theory to analyze deep neural networks, revealing an order-to-chaos transition that exponentially increases expressivity with depth and formalizing how deep networks disentangle complex input manifolds.

Contribution

It introduces a novel theoretical framework linking geometry and chaos to explain deep networks' exponential expressivity and their ability to disentangle complex manifolds.

Findings

01

Networks in the chaotic phase compute functions with exponentially growing curvature.

02

Deep networks cannot be efficiently approximated by shallow ones.

03

Deep networks can transform highly curved input manifolds into flatter hidden representations.

Abstract

We combine Riemannian geometry with the mean field theory of high dimensional chaos to study the nature of signal propagation in generic, deep neural networks with random weights. Our results reveal an order-to-chaos expressivity phase transition, with networks in the chaotic phase computing nonlinear functions whose global curvature grows exponentially with depth but not width. We prove this generic class of deep random functions cannot be efficiently computed by any shallow network, going beyond prior work restricted to the analysis of single functions. Moreover, we formalize and quantitatively demonstrate the long conjectured idea that deep networks can disentangle highly curved manifolds in input space into flat manifolds in hidden space. Our theoretical analysis of the expressive power of deep networks broadly applies to arbitrary nonlinearities, and provides a quantitative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ganguli-lab/deepchaos
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Neural dynamics and brain function