Better Neural Network Expressivity: Subdividing the Simplex
Egor Bakaev, Florestan Brunck, Christoph Hertrich, Jack Stade, Amir Yehudayoff

TL;DR
This paper demonstrates that ReLU neural networks can compute all continuous piecewise linear functions on ^n with fewer layers than previously thought, using novel geometric constructions and polyhedral subdivisions.
Contribution
It disproves a conjecture by showing fewer layers are needed for universal CPWL function representation, introducing new constructions for maximum functions with ReLU networks.
Findings
Fewer layers ( log_3(n)) suffice for universal CPWL representation.
ReLU networks with two hidden layers can exactly compute the maximum of five inputs.
The constructions nearly match known lower bounds, advancing understanding of neural network expressivity.
Abstract
This work studies the expressivity of ReLU neural networks with a focus on their depth. A sequence of previous works showed that hidden layers are sufficient to compute all continuous piecewise linear (CPWL) functions on . Hertrich, Basu, Di Summa, and Skutella (NeurIPS'21 / SIDMA'23) conjectured that this result is optimal in the sense that there are CPWL functions on , like the maximum function, that require this depth. We disprove the conjecture and show that hidden layers are sufficient to compute all CPWL functions on . A key step in the proof is that ReLU neural networks with two hidden layers can exactly represent the maximum function of five inputs. More generally, we show that hidden layers are sufficient to compute the maximum of numbers.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsFocus · *Communicated@Fast*How Do I Communicate to Expedia?
