Deep reinforcement learning for machine scheduling: Methodology, the state-of-the-art, and future directions
Maziyar Khadivi, Todd Charter, Marjan Yaghoubi, Masoud Jalayer, Maryam, Ahang, Ardeshir Shojaeinasab, Homayoun Najjaran

TL;DR
This paper reviews deep reinforcement learning methods for machine scheduling, comparing their methodologies and highlighting their advantages, limitations, and future research directions in optimizing complex manufacturing processes.
Contribution
It provides a comprehensive categorization and comparison of DRL-based scheduling approaches, emphasizing their performance and identifying key challenges for future work.
Findings
DRL methods outperform traditional solvers and heuristics in speed and near-optimal solutions
DRL approaches are effective for static and dynamic scheduling in various environments
Limitations include handling complex constraints and ensuring scalability and robustness
Abstract
Machine scheduling aims to optimize job assignments to machines while adhering to manufacturing rules and job specifications. This optimization leads to reduced operational costs, improved customer demand fulfillment, and enhanced production efficiency. However, machine scheduling remains a challenging combinatorial problem due to its NP-hard nature. Deep Reinforcement Learning (DRL), a key component of artificial general intelligence, has shown promise in various domains like gaming and robotics. Researchers have explored applying DRL to machine scheduling problems since 1995. This paper offers a comprehensive review and comparison of DRL-based approaches, highlighting their methodology, applications, advantages, and limitations. It categorizes these approaches based on computational components: conventional neural networks, encoder-decoder architectures, graph neural networks, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Metaheuristic Optimization Algorithms Research · Reinforcement Learning in Robotics
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Focus
