Bounding and Counting Linear Regions of Deep Neural Networks
Thiago Serra, Christian Tjandraatmadja, Srikumar Ramalingam

TL;DR
This paper analyzes the complexity of deep neural networks by studying the number of linear regions they can produce, providing new bounds and methods for counting these regions, especially in rectifier and maxout networks.
Contribution
It introduces tighter bounds for the number of linear regions in rectifier networks and a novel mixed-integer linear method for exact enumeration.
Findings
Deep rectifier networks can surpass shallow ones in linear regions only if neuron count exceeds input dimension.
New bounds are exact for one-dimensional inputs.
A mixed-integer linear formulation enables precise counting of linear regions.
Abstract
We investigate the complexity of deep neural networks (DNN) that represent piecewise linear (PWL) functions. In particular, we study the number of linear regions, i.e. pieces, that a PWL function represented by a DNN can attain, both theoretically and empirically. We present (i) tighter upper and lower bounds for the maximum number of linear regions on rectifier networks, which are exact for inputs of dimension one; (ii) a first upper bound for multi-layer maxout networks; and (iii) a first method to perform exact enumeration or counting of the number of regions by modeling the DNN with a mixed-integer linear formulation. These bounds come from leveraging the dimension of the space defining each linear region. The results also indicate that a deep rectifier network can only have more linear regions than every shallow counterpart with same number of neurons if that number exceeds the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning
