Offline reinforcement learning for job-shop scheduling problems

Imanol Echeverria; Maialen Murua; Roberto Santana

arXiv:2410.15714·cs.LG·April 8, 2025

Offline reinforcement learning for job-shop scheduling problems

Imanol Echeverria, Maialen Murua, Roberto Santana

PDF

Open Access

TL;DR

This paper presents a novel offline reinforcement learning approach tailored for complex combinatorial optimization problems like job-shop scheduling, effectively handling heterogeneous graph states and variable actions to outperform existing methods.

Contribution

Introduces an offline RL method that encodes actions as edge attributes and balances reward optimization with imitation, addressing limitations of prior RL and behavioral cloning approaches.

Findings

01

Achieves superior performance on job-shop scheduling benchmarks.

02

Effectively handles complex constraints and variable actions.

03

Outperforms state-of-the-art techniques in experiments.

Abstract

Recent advances in deep learning have shown significant potential for solving combinatorial optimization problems in real-time. Unlike traditional methods, deep learning can generate high-quality solutions efficiently, which is crucial for applications like routing and scheduling. However, existing approaches like deep reinforcement learning (RL) and behavioral cloning have notable limitations, with deep RL suffering from slow learning and behavioral cloning relying solely on expert actions, which can lead to generalization issues and neglect of the optimization objective. This paper introduces a novel offline RL method designed for combinatorial optimization problems with complex constraints, where the state is represented as a heterogeneous graph and the action space is variable. Our approach encodes actions in edge attributes and balances expected rewards with the imitation of expert…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Assembly Line Balancing Optimization · Advanced Manufacturing and Logistics Optimization