Policy-Based Reinforcement Learning with Action Masking for Dynamic Job Shop Scheduling under Uncertainty: Handling Random Arrivals and Machine Failures

Sofiene Lassoued; Stefan Lier; Andreas Schwung

arXiv:2601.09293·cs.AI·January 15, 2026

Policy-Based Reinforcement Learning with Action Masking for Dynamic Job Shop Scheduling under Uncertainty: Handling Random Arrivals and Machine Failures

Sofiene Lassoued, Stefan Lier, Andreas Schwung

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning framework using action masking and Petri nets to effectively solve dynamic job shop scheduling problems under uncertainty, including random arrivals and machine failures.

Contribution

It combines model-based Petri nets with maskable RL policies to handle complex stochastic industrial scenarios, improving scheduling performance and robustness.

Findings

01

Outperforms traditional heuristics in makespan minimization

02

Effectively models stochastic job arrivals and machine failures

03

Demonstrates scalability and explainability in dynamic environments

Abstract

We present a novel framework for solving Dynamic Job Shop Scheduling Problems under uncertainty, addressing the challenges introduced by stochastic job arrivals and unexpected machine breakdowns. Our approach follows a model-based paradigm, using Coloured Timed Petri Nets to represent the scheduling environment, and Maskable Proximal Policy Optimization to enable dynamic decision-making while restricting the agent to feasible actions at each decision point. To simulate realistic industrial conditions, dynamic job arrivals are modeled using a Gamma distribution, which captures complex temporal patterns such as bursts, clustering, and fluctuating workloads. Machine failures are modeled using a Weibull distribution to represent age-dependent degradation and wear-out dynamics. These stochastic models enable the framework to reflect real-world manufacturing scenarios better. In addition, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Petri Nets in System Modeling · Advanced Queuing Theory Analysis