Deep Reinforcement Learning for Multi-Resource Multi-Machine Job Scheduling
Weijia Chen, Yuedong Xu, Xiaofeng Wu

TL;DR
This paper applies deep reinforcement learning to optimize multi-resource, multi-machine job scheduling in data centers, demonstrating potential improvements over traditional algorithms in complex environments.
Contribution
It extends a recent deep reinforcement learning-based scheduling algorithm to multiple server clusters, showing enhanced performance in complex data center scenarios.
Findings
Deep reinforcement learning can outperform traditional scheduling algorithms.
The extended method effectively manages multi-resource, multi-machine environments.
Potential for improved efficiency in data center job scheduling.
Abstract
Minimizing job scheduling time is a fundamental issue in data center networks that has been extensively studied in recent years. The incoming jobs require different CPU and memory units, and span different number of time slots. The traditional solution is to design efficient heuristic algorithms with performance guarantee under certain assumptions. In this paper, we improve a recently proposed job scheduling algorithm using deep reinforcement learning and extend it to multiple server clusters. Our study reveals that deep reinforcement learning method has the potential to outperform traditional resource allocation algorithms in a variety of complicated environments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Distributed and Parallel Computing Systems
