On minimal representations of shallow ReLU networks
S. Dereich, S. Kassing

TL;DR
This paper characterizes the minimal number of neurons needed for shallow ReLU networks to represent continuous piecewise affine functions, revealing differences between one-dimensional and higher-dimensional cases.
Contribution
It provides a precise characterization of minimal network sizes, describes the structure of minimal networks as a smooth manifold, and offers criteria for hyperplanes to realize all such functions.
Findings
Minimal representations use n, n+1, or n+2 neurons.
In 1D, at most n+1 neurons are needed; higher dimensions may require n+2.
The set of minimal networks forms a smooth manifold with known dimension.
Abstract
The realization function of a shallow ReLU network is a continuous and piecewise affine function , where the domain is partitioned by a set of hyperplanes into cells on which is affine. We show that the minimal representation for uses either , or neurons and we characterize each of the three cases. In the particular case, where the input layer is one-dimensional, minimal representations always use at most neurons but in all higher dimensional settings there are functions for which neurons are needed. Then we show that the set of minimal networks representing forms a -submanifold and we derive the dimension and the number of connected components of . Additionally, we give a criterion for the hyperplanes that guarantees that all continuous, piecewise affine functions are realization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Neural dynamics and brain function
