A Reinforcement Learning Environment For Job-Shop Scheduling

Pierre Tassel; Martin Gebser; Konstantin Schekotihin

arXiv:2104.03760·cs.LG·April 9, 2021·46 cites

A Reinforcement Learning Environment For Job-Shop Scheduling

Pierre Tassel, Martin Gebser, Konstantin Schekotihin

PDF

Open Access 4 Repos

TL;DR

This paper introduces a new Deep Reinforcement Learning environment for job-shop scheduling, featuring a compact state representation and a reward function, achieving near state-of-the-art results on benchmark instances.

Contribution

The paper presents a novel DRL environment with a meaningful state representation and reward function tailored for job-shop scheduling, improving performance over existing DRL methods.

Findings

01

Significantly outperforms existing DRL methods on benchmark instances

02

Achieves results close to state-of-the-art combinatorial optimization approaches

03

Provides an efficient environment for applying DRL to job-shop scheduling

Abstract

Scheduling is a fundamental task occurring in various automated systems applications, e.g., optimal schedules for machines on a job shop allow for a reduction of production costs and waste. Nevertheless, finding such schedules is often intractable and cannot be achieved by Combinatorial Optimization Problem (COP) methods within a given time limit. Recent advances of Deep Reinforcement Learning (DRL) in learning complex behavior enable new COP application possibilities. This paper presents an efficient DRL environment for Job-Shop Scheduling -- an important problem in the field. Furthermore, we design a meaningful and compact state representation as well as a novel, simple dense reward function, closely related to the sparse make-span minimization criteria used by COP methods. We demonstrate that our approach significantly outperforms existing DRL methods on classic benchmark instances,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Reinforcement Learning in Robotics · Optimization and Search Problems

MethodsEntropy Regularization · Proximal Policy Optimization