Efficient End-to-End Learning for Decision-Making: A Meta-Optimization Approach
Rares Cristian, Pavithra Harsha, Georgia Perakis, Brian Quanz

TL;DR
This paper introduces a meta-optimization approach that trains neural networks to efficiently approximate complex optimization problems, significantly reducing computational costs in end-to-end decision-making models across various applications.
Contribution
The paper presents a novel neural network architecture with theoretical guarantees that accelerates optimization in end-to-end learning, applicable to diverse large-scale problems.
Findings
Achieves exponential convergence and approximation guarantees.
Provides faster, scalable solutions compared to existing methods.
Demonstrates effectiveness on real-world and synthetic optimization tasks.
Abstract
End-to-end learning has become a widely applicable and studied problem in training predictive ML models to be aware of their impact on downstream decision-making tasks. These end-to-end models often outperform traditional methods that separate training from the optimization and only myopically focus on prediction error. However, the computational complexity of end-to-end frameworks poses a significant challenge, particularly for large-scale problems. While training an ML model using gradient descent, each time we need to compute a gradient we must solve an expensive optimization problem. We present a meta-optimization method that learns efficient algorithms to approximate optimization problems, dramatically reducing computational overhead of solving the decision problem in general, an aspect we leverage in the training within the end-to-end framework. Our approach introduces a neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods · Stochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs
MethodsAttentive Walk-Aggregating Graph Neural Network · Focus
