On the number of response regions of deep feed forward networks with   piece-wise linear activations

Razvan Pascanu; Guido Montufar; Yoshua Bengio

arXiv:1312.6098·cs.LG·February 17, 2014·ICLR·126 cites

On the number of response regions of deep feed forward networks with piece-wise linear activations

Razvan Pascanu, Guido Montufar, Yoshua Bengio

PDF

Open Access

TL;DR

This paper analyzes the number of linear regions in deep versus shallow piecewise linear neural networks, providing geometric bounds and demonstrating the greater complexity of deep models.

Contribution

It introduces a geometric framework for comparing deep and shallow networks' complexity, deriving bounds on their linear regions, and highlighting the advantages of depth.

Findings

01

Deep networks have exponentially more linear regions than shallow ones as parameters grow.

02

The number of linear regions in deep models outpaces shallow models asymptotically.

03

Deep models with fixed input size can significantly surpass shallow models in complexity.

Abstract

This paper explores the complexity of deep feedforward networks with linear pre-synaptic couplings and rectified linear activations. This is a contribution to the growing body of work contrasting the representational power of deep and shallow network architectures. In particular, we offer a framework for comparing deep and shallow models that belong to the family of piecewise linear functions based on computational geometry. We look at a deep rectifier multi-layer perceptron (MLP) with linear outputs units and compare it with a single layer version of the model. In the asymptotic regime, when the number of inputs stays constant, if the shallow model has $k n$ hidden units and $n_{0}$ inputs, then the number of linear regions is $O (k^{n_{0}} n^{n_{0}})$ . For a $k$ layer model with $n$ hidden units on each layer it is $Ω (⌊ n / n_{0} ⌋^{k - 1} n^{n_{0}})$ . The number…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Neural Networks and Applications