Learning Optimal Solutions via an LSTM-Optimization Framework
Dogacan Yilmaz, \.I. Esra B\"uy\"uktahtak{\i}n

TL;DR
This paper introduces an LSTM-based deep learning framework that predicts optimal solutions for dynamic mixed-integer programs, significantly reducing solution times while maintaining high solution quality.
Contribution
It develops a bidirectional LSTM framework for sequential decision problems, demonstrating substantial speed-ups in solving the capacitated lot-sizing problem with minimal loss in optimality.
Findings
Reduces solution time by a factor of 9 on average for benchmark problems.
Achieves less than 0.05% optimality gap with high feasibility.
Outperforms classical ML and exact algorithms in speed and accuracy.
Abstract
In this study, we present a deep learning-optimization framework to tackle dynamic mixed-integer programs. Specifically, we develop a bidirectional Long Short Term Memory (LSTM) framework that can process information forward and backward in time to learn optimal solutions to sequential decision-making problems. We demonstrate our approach in predicting the optimal decisions for the single-item capacitated lot-sizing problem (CLSP), where a binary variable denotes whether to produce in a period or not. Due to the dynamic nature of the problem, the CLSP can be treated as a sequence labeling task where a recurrent neural network can capture the problem's temporal dynamics. Computational results show that our LSTM-Optimization (LSTM-Opt) framework significantly reduces the solution time of benchmark CLSP problems without much loss in feasibility and optimality. For example, the predictions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Advanced Multi-Objective Optimization Algorithms · Optimization and Packing Problems
MethodsTest · Tanh Activation · Logistic Regression · Sigmoid Activation · Long Short-Term Memory
