A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Michael Matena, Colin Raffel

TL;DR
This paper models shallow ReLU network optimization as a combinatorial problem using zonotopes, introducing methods for vertex selection and heuristics that improve training efficiency and solution quality.
Contribution
It introduces a geometric and combinatorial framework for ReLU optimization using zonotopes, along with novel polynomial-time vertex selection and greedy heuristics.
Findings
Zonotope vertex set models feasible activation patterns.
Overparameterization improves likelihood of good solutions.
Proposed methods outperform gradient descent in certain settings.
Abstract
The NP-hard problem of optimizing a shallow ReLU network can be characterized as a combinatorial search over each training example's activation pattern followed by a constrained convex problem given a fixed set of activation patterns. We explore the implications of this combinatorial aspect of ReLU optimization in this work. We show that it can be naturally modeled via a geometric and combinatoric object known as a zonotope with its vertex set isomorphic to the set of feasible activation patterns. This assists in analysis and provides a foundation for further research. We demonstrate its usefulness when we explore the sensitivity of the optimal loss to perturbations of the training data. Later we discuss methods of zonotope vertex selection and its relevance to optimization. Overparameterization assists in training by making a randomly chosen vertex more likely to contain a good…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsInfrared Target Detection Methodologies · Advanced Image and Video Retrieval Techniques · Face and Expression Recognition
