Policy-Based Reinforcement Learning with Action Masking for Dynamic Job Shop Scheduling under Uncertainty: Handling Random Arrivals and Machine Failures
Sofiene Lassoued, Stefan Lier, Andreas Schwung

TL;DR
This paper introduces a reinforcement learning framework using action masking and Petri nets to effectively solve dynamic job shop scheduling problems under uncertainty, including random arrivals and machine failures.
Contribution
It combines model-based Petri nets with maskable RL policies to handle complex stochastic industrial scenarios, improving scheduling performance and robustness.
Findings
Outperforms traditional heuristics in makespan minimization
Effectively models stochastic job arrivals and machine failures
Demonstrates scalability and explainability in dynamic environments
Abstract
We present a novel framework for solving Dynamic Job Shop Scheduling Problems under uncertainty, addressing the challenges introduced by stochastic job arrivals and unexpected machine breakdowns. Our approach follows a model-based paradigm, using Coloured Timed Petri Nets to represent the scheduling environment, and Maskable Proximal Policy Optimization to enable dynamic decision-making while restricting the agent to feasible actions at each decision point. To simulate realistic industrial conditions, dynamic job arrivals are modeled using a Gamma distribution, which captures complex temporal patterns such as bursts, clustering, and fluctuating workloads. Machine failures are modeled using a Weibull distribution to represent age-dependent degradation and wear-out dynamics. These stochastic models enable the framework to reflect real-world manufacturing scenarios better. In addition, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Petri Nets in System Modeling · Advanced Queuing Theory Analysis
