heSRPT: Parallel Scheduling to Minimize Mean Slowdown
Benjamin Berg, Rein Vesilo, Mor Harchol-Balter

TL;DR
This paper introduces heSRPT, an optimal parallel server allocation policy for minimizing mean slowdown of parallelizable jobs, balancing efficiency and job prioritization, with proven optimality in static and effective heuristics online.
Contribution
It provides the first closed-form optimal allocation policy for minimizing mean slowdown of parallelizable jobs, balancing efficiency and prioritization, and extends to online job arrivals.
Findings
heSRPT is optimal for static job sets.
heSRPT outperforms existing policies in simulations.
The policy balances efficiency and job prioritization.
Abstract
Modern data centers serve workloads which are capable of exploiting parallelism. When a job parallelizes across multiple servers it will complete more quickly, but jobs receive diminishing returns from being allocated additional servers. Because allocating multiple servers to a single job is inefficient, it is unclear how best to allocate a fixed number of servers between many parallelizable jobs. This paper provides the first optimal allocation policy for minimizing the mean slowdown of parallelizable jobs of known size when all jobs are present at time 0. Our policy provides a simple closed form formula for the optimal allocations at every moment in time. Minimizing mean slowdown usually requires favoring short jobs over long ones (as in the SRPT policy). However, because parallelizable jobs have sublinear speedup functions, system efficiency is also an issue. System efficiency is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
