Exploring Multi-Agent Reinforcement Learning for Unrelated Parallel Machine Scheduling
Maria Zampella, Urtzi Otamendi, Xabier Belaunzaran, Arkaitz Artetxe,, Igor G. Olaizola, Giuseppe Longo, Basilio Sierra

TL;DR
This paper explores the application of Multi-Agent Reinforcement Learning to the complex Unrelated Parallel Machine Scheduling Problem, demonstrating the effectiveness of MARL approaches and highlighting challenges in cooperative learning.
Contribution
It introduces a MARL framework for UPMS with setup times, compares single- and multi-agent algorithms, and provides empirical insights into their performance and scalability.
Findings
MARL approaches outperform single-agent algorithms in scalable scenarios
Maskable PPO enhances single-agent scheduling performance
Multi-Agent PPO shows potential but faces cooperative learning challenges
Abstract
Scheduling problems pose significant challenges in resource, industry, and operational management. This paper addresses the Unrelated Parallel Machine Scheduling Problem (UPMS) with setup times and resources using a Multi-Agent Reinforcement Learning (MARL) approach. The study introduces the Reinforcement Learning environment and conducts empirical analyses, comparing MARL with Single-Agent algorithms. The experiments employ various deep neural network policies for single- and Multi-Agent approaches. Results demonstrate the efficacy of the Maskable extension of the Proximal Policy Optimization (PPO) algorithm in Single-Agent scenarios and the Multi-Agent PPO algorithm in Multi-Agent setups. While Single-Agent algorithms perform adequately in reduced scenarios, Multi-Agent approaches reveal challenges in cooperative learning but a scalable capacity. This research contributes insights…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Assembly Line Balancing Optimization · Advanced Manufacturing and Logistics Optimization
MethodsEntropy Regularization · Proximal Policy Optimization
