Offline reinforcement learning for job-shop scheduling problems
Imanol Echeverria, Maialen Murua, Roberto Santana

TL;DR
This paper presents a novel offline reinforcement learning approach tailored for complex combinatorial optimization problems like job-shop scheduling, effectively handling heterogeneous graph states and variable actions to outperform existing methods.
Contribution
Introduces an offline RL method that encodes actions as edge attributes and balances reward optimization with imitation, addressing limitations of prior RL and behavioral cloning approaches.
Findings
Achieves superior performance on job-shop scheduling benchmarks.
Effectively handles complex constraints and variable actions.
Outperforms state-of-the-art techniques in experiments.
Abstract
Recent advances in deep learning have shown significant potential for solving combinatorial optimization problems in real-time. Unlike traditional methods, deep learning can generate high-quality solutions efficiently, which is crucial for applications like routing and scheduling. However, existing approaches like deep reinforcement learning (RL) and behavioral cloning have notable limitations, with deep RL suffering from slow learning and behavioral cloning relying solely on expert actions, which can lead to generalization issues and neglect of the optimization objective. This paper introduces a novel offline RL method designed for combinatorial optimization problems with complex constraints, where the state is represented as a heterogeneous graph and the action space is variable. Our approach encodes actions in edge attributes and balances expected rewards with the imitation of expert…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Assembly Line Balancing Optimization · Advanced Manufacturing and Logistics Optimization
