Policy-Based Deep Reinforcement Learning Hyperheuristics for Job-Shop Scheduling Problems

Sofiene Lassoued; Asrat Gobachew; Stefan Lier; Andreas Schwung

arXiv:2601.11189·cs.AI·January 19, 2026

Policy-Based Deep Reinforcement Learning Hyperheuristics for Job-Shop Scheduling Problems

Sofiene Lassoued, Asrat Gobachew, Stefan Lier, Andreas Schwung

PDF

Open Access

TL;DR

This paper introduces a novel policy-based deep reinforcement learning hyper-heuristic framework for Job Shop Scheduling, incorporating mechanisms for feasible action filtering and heuristic switching regulation, leading to improved scheduling performance.

Contribution

It presents a new hyper-heuristic approach with action prefiltering and commitment strategies, advancing the application of deep RL in JSSP.

Findings

01

Outperforms traditional heuristics and metaheuristics

02

Effective in reducing makespan on benchmark problems

03

Demonstrates advantages of stochastic over deterministic action selection

Abstract

This paper proposes a policy-based deep reinforcement learning hyper-heuristic framework for solving the Job Shop Scheduling Problem. The hyper-heuristic agent learns to switch scheduling rules based on the system state dynamically. We extend the hyper-heuristic framework with two key mechanisms. First, action prefiltering restricts decision-making to feasible low-level actions, enabling low-level heuristics to be evaluated independently of environmental constraints and providing an unbiased assessment. Second, a commitment mechanism regulates the frequency of heuristic switching. We investigate the impact of different commitment strategies, from step-wise switching to full-episode commitment, on both training behavior and makespan. Additionally, we compare two action selection strategies at the policy level: deterministic greedy selection and stochastic sampling. Computational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Constraint Satisfaction and Optimization · Resource-Constrained Project Scheduling