Neural Networks and (Virtual) Extended Formulations

Christoph Hertrich; Georg Loho

arXiv:2411.03006·math.CO·September 23, 2025

Neural Networks and (Virtual) Extended Formulations

Christoph Hertrich, Georg Loho

PDF

Open Access

TL;DR

This paper establishes lower bounds on the size of certain neural networks by linking their capabilities to extension complexity of polytopes, introducing the novel concept of virtual extension complexity for more general neural network bounds.

Contribution

It connects neural network complexity to polyhedral extension complexity and introduces virtual extension complexity as a new tool for analyzing neural network representations.

Findings

01

Exponential lower bounds for monotone and input-convex neural networks solving linear problems.

02

Introduction of virtual extension complexity as a generalization of extension complexity.

03

Demonstration that small virtual extended formulations enable efficient optimization over polytopes.

Abstract

Neural networks with piecewise linear activation functions, such as rectified linear units (ReLU) or maxout, are among the most fundamental models in modern machine learning. We make a step towards proving lower bounds on the size of such neural networks by linking their representative capabilities to the notion of the extension complexity $xc (P)$ of a polytope $P$ . This is a well-studied quantity in combinatorial optimization and polyhedral geometry describing the number of inequalities needed to model $P$ as a linear program. We show that $xc (P)$ is a lower bound on the size of any monotone or input-convex neural network that solves the linear optimization problem over $P$ . This implies exponential lower bounds on such neural networks for a variety of problems, including the polynomially solvable maximum weight matching problem. In an attempt to prove similar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications