Deep reinforcement learning for machine scheduling: Methodology, the   state-of-the-art, and future directions

Maziyar Khadivi; Todd Charter; Marjan Yaghoubi; Masoud Jalayer; Maryam; Ahang; Ardeshir Shojaeinasab; Homayoun Najjaran

arXiv:2310.03195·cs.LG·January 30, 2025

Deep reinforcement learning for machine scheduling: Methodology, the state-of-the-art, and future directions

Maziyar Khadivi, Todd Charter, Marjan Yaghoubi, Masoud Jalayer, Maryam, Ahang, Ardeshir Shojaeinasab, Homayoun Najjaran

PDF

Open Access

TL;DR

This paper reviews deep reinforcement learning methods for machine scheduling, comparing their methodologies and highlighting their advantages, limitations, and future research directions in optimizing complex manufacturing processes.

Contribution

It provides a comprehensive categorization and comparison of DRL-based scheduling approaches, emphasizing their performance and identifying key challenges for future work.

Findings

01

DRL methods outperform traditional solvers and heuristics in speed and near-optimal solutions

02

DRL approaches are effective for static and dynamic scheduling in various environments

03

Limitations include handling complex constraints and ensuring scalability and robustness

Abstract

Machine scheduling aims to optimize job assignments to machines while adhering to manufacturing rules and job specifications. This optimization leads to reduced operational costs, improved customer demand fulfillment, and enhanced production efficiency. However, machine scheduling remains a challenging combinatorial problem due to its NP-hard nature. Deep Reinforcement Learning (DRL), a key component of artificial general intelligence, has shown promise in various domains like gaming and robotics. Researchers have explored applying DRL to machine scheduling problems since 1995. This paper offers a comprehensive review and comparison of DRL-based approaches, highlighting their methodology, applications, advantages, and limitations. It categorizes these approaches based on computational components: conventional neural networks, encoder-decoder architectures, graph neural networks, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Metaheuristic Optimization Algorithms Research · Reinforcement Learning in Robotics

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Focus