Pruning for efficient deterministic global optimization over trained ReLU neural networks
Giacomo Lastrucci, Tanuj Karia, Victor Schulte, Dominik Bongartz, Artur M. Schweidtmann

TL;DR
This paper demonstrates that pruning neural networks significantly accelerates embedded optimization problems by reducing problem size and tightening bounds, enabling faster solutions in complex engineering applications.
Contribution
It provides a theoretical and empirical analysis of how weight and node pruning improve the efficiency of ReLU neural network-based optimization models.
Findings
Pruning achieves speedups of up to 10^3-10^4 times.
Pruning reduces problem size and number of integer variables.
Pruning tightens big-M bounds, improving solver performance.
Abstract
Neural networks are increasingly used as surrogates in optimization problems to replace computationally expensive models. However, embedding ReLU neural networks in mathematical programs introduces significant computational challenges, particularly for deep and wide networks, due to both the formulation of the ReLU disjunction and the resulting large-scale optimization problem. This work investigates how pruning techniques can accelerate the solution of optimization problems with embedded neural networks, focusing on the mechanisms underlying the computational gains. We provide theoretical insights into how both unstructured (weight) and structured (node) pruning affect the ReLU big-M formulation, showing that pruning monotonically tightens preactivation bounds. We conduct comprehensive empirical studies across multiple network architectures using an illustrative test function and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
