Deep Reinforcement Learning for Multi-Resource Multi-Machine Job   Scheduling

Weijia Chen; Yuedong Xu; Xiaofeng Wu

arXiv:1711.07440·cs.DC·November 21, 2017·19 cites

Deep Reinforcement Learning for Multi-Resource Multi-Machine Job Scheduling

Weijia Chen, Yuedong Xu, Xiaofeng Wu

PDF

Open Access

TL;DR

This paper applies deep reinforcement learning to optimize multi-resource, multi-machine job scheduling in data centers, demonstrating potential improvements over traditional algorithms in complex environments.

Contribution

It extends a recent deep reinforcement learning-based scheduling algorithm to multiple server clusters, showing enhanced performance in complex data center scenarios.

Findings

01

Deep reinforcement learning can outperform traditional scheduling algorithms.

02

The extended method effectively manages multi-resource, multi-machine environments.

03

Potential for improved efficiency in data center job scheduling.

Abstract

Minimizing job scheduling time is a fundamental issue in data center networks that has been extensively studied in recent years. The incoming jobs require different CPU and memory units, and span different number of time slots. The traditional solution is to design efficient heuristic algorithms with performance guarantee under certain assumptions. In this paper, we improve a recently proposed job scheduling algorithm using deep reinforcement learning and extend it to multiple server clusters. Our study reveals that deep reinforcement learning method has the potential to outperform traditional resource allocation algorithms in a variety of complicated environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Distributed and Parallel Computing Systems