Improving Multiresource Job Scheduling with Markovian Service Rate   Policies

Zhongrui Chen; Isaac Grosof; Benjamin Berg

arXiv:2412.08915·cs.PF·April 18, 2025

Improving Multiresource Job Scheduling with Markovian Service Rate Policies

Zhongrui Chen, Isaac Grosof, Benjamin Berg

PDF

TL;DR

This paper introduces Markovian Service Rate (MSR) policies for multiresource job scheduling in cloud systems, providing a simple, analyzable, and throughput-optimal approach that minimizes mean response time.

Contribution

The paper proposes a new class of MSR scheduling policies that are simple to implement, throughput-optimal, and come with tight bounds on mean response time for multiresource jobs.

Findings

01

MSR policies are throughput-optimal for multiresource job scheduling.

02

Derived bounds on mean response time are tight up to an additive constant.

03

Applicable to systems with various preemption behaviors.

Abstract

Modern cloud computing workloads are composed of multiresource jobs that require a variety of computational resources in order to run, such as CPU cores, memory, disk space, or hardware accelerators. A single cloud server can typically run many multiresource jobs in parallel, but only if the server has sufficient resources to satisfy the demands of every job. A scheduling policy must therefore select sets of multiresource jobs to run in parallel in order to minimize the mean response time across jobs -- the average time from when a job arrives to the system until it is completed. Unfortunately, achieving low response times by selecting sets of jobs that fully utilize the available server resources has proven to be a difficult problem. In this paper, we develop and analyze a new class of policies for scheduling multiresource jobs, called Markovian Service Rate (MSR) policies. While…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.