Towards Standardising Reinforcement Learning Approaches for Production   Scheduling Problems

Alexandru Rinciog; Anne Meyer

arXiv:2104.08196·cs.LG·February 16, 2023

Towards Standardising Reinforcement Learning Approaches for Production Scheduling Problems

Alexandru Rinciog, Anne Meyer

PDF

Open Access 1 Repo

TL;DR

This paper advocates for standardising reinforcement learning methods and validation procedures in production scheduling to enhance reproducibility, comparability, and industry applicability.

Contribution

It introduces standardized descriptions for production setups, classifies RL design choices, and recommends validation schemes for reproducibility and benchmarking.

Findings

01

Standardized production setup descriptions based on established nomenclature.

02

Classification of RL design choices from existing literature.

03

Recommendations for validation schemes emphasizing reproducibility.

Abstract

Recent years have seen a rise in interest in terms of using machine learning, particularly reinforcement learning (RL), for production scheduling problems of varying degrees of complexity. The general approach is to break down the scheduling problem into a Markov Decision Process (MDP), whereupon a simulation implementing the MDP is used to train an RL agent. Since existing studies rely on (sometimes) complex simulations for which the code is unavailable, the experiments presented are hard, or, in the case of stochastic environments, impossible to reproduce accurately. Furthermore, there is a vast array of RL designs to choose from. To make RL methods widely applicable in production scheduling and work out their strength for the industry, the standardisation of model descriptions - both production setup and RL design - and validation scheme are a prerequisite. Our contribution is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

malerinc/fabricatio-rl
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Reinforcement Learning in Robotics · Assembly Line Balancing Optimization