schlably: A Python Framework for Deep Reinforcement Learning Based Scheduling Experiments
Constantin Waubert de Puiseau, Jannik Peters, Christian D\"orpelkus,, Hasan Tercan, Tobias Meisen

TL;DR
schlably is a Python framework designed to streamline and standardize deep reinforcement learning experiments for production scheduling, enhancing comparability and reusability of research in this domain.
Contribution
It introduces a comprehensive, flexible framework that reduces redundant effort and enables consistent experimentation in DRL-based production scheduling research.
Findings
Facilitates standardized DRL scheduling experiments
Increases research comparability and reusability
Reduces development overhead for scheduling solutions
Abstract
Research on deep reinforcement learning (DRL) based production scheduling (PS) has gained a lot of attention in recent years, primarily due to the high demand for optimizing scheduling problems in diverse industry settings. Numerous studies are carried out and published as stand-alone experiments that often vary only slightly with respect to problem setups and solution approaches. The programmatic core of these experiments is typically very similar. Despite this fact, no standardized and resilient framework for experimentation on PS problems with DRL algorithms could be established so far. In this paper, we introduce schlably, a Python-based framework that provides researchers a comprehensive toolset to facilitate the development of PS solution strategies based on DRL. schlably eliminates the redundant overhead work that the creation of a sturdy and flexible backbone requires and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Reinforcement Learning in Robotics · Robot Manipulation and Learning
MethodsQ-Learning · Deep Q-Network · Proximal Policy Optimization
