OptNet: Differentiable Optimization as a Layer in Neural Networks
Brandon Amos, J. Zico Kolter

TL;DR
OptNet introduces a novel neural network layer that incorporates quadratic programming optimization problems, enabling the encoding of complex constraints and dependencies within end-to-end trainable models, demonstrated through learning to play mini-Sudoku.
Contribution
This work develops a differentiable optimization layer using quadratic programs, with efficient GPU-based solvers and applications in constraint learning and game playing.
Findings
Successfully differentiates through quadratic programming layers
Efficient GPU-based solver with minimal additional cost
Learned to play mini-Sudoku without prior rule knowledge
Abstract
This paper presents OptNet, a network architecture that integrates optimization problems (here, specifically in the form of quadratic programs) as individual layers in larger end-to-end trainable deep networks. These layers encode constraints and complex dependencies between the hidden states that traditional convolutional and fully-connected layers often cannot capture. We explore the foundations for such an architecture: we show how techniques from sensitivity analysis, bilevel optimization, and implicit differentiation can be used to exactly differentiate through these layers and with respect to layer parameters; we develop a highly efficient solver for these layers that exploits fast GPU-based batch solves within a primal-dual interior point method, and which provides backpropagation gradients with virtually no additional cost on top of the solve; and we highlight the application of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Model Reduction and Neural Networks · Advanced Numerical Analysis Techniques
