Towards Optimality in Parallel Scheduling
Benjamin Berg, Jan-Pieter Dorsman, Mor Harchol-Balter

TL;DR
This paper develops an analytical model for scheduling jobs on multi-core systems, proving the optimality of evenly dividing cores under certain conditions and proposing practical fixed-level policies for diverse workloads.
Contribution
It introduces the EQUI policy's optimality for single speedup curves and derives the best fixed parallelization level, k*, for improved scheduling without job level changes.
Findings
EQUI is optimal for jobs with identical speedup curves and exponential sizes.
A fixed parallelization level k* can match EQUI's performance without dynamic adjustments.
GREEDY* performs near-optimally for heterogeneous speedup curves.
Abstract
To keep pace with Moore's law, chip designers have focused on increasing the number of cores per chip rather than single core performance. In turn, modern jobs are often designed to run on any number of cores. However, to effectively leverage these multi-core chips, one must address the question of how many cores to assign to each job. Given that jobs receive sublinear speedups from additional cores, there is an obvious tradeoff: allocating more cores to an individual job reduces the job's runtime, but in turn decreases the efficiency of the overall system. We ask how the system should schedule jobs across cores so as to minimize the mean response time over a stream of incoming jobs. To answer this question, we develop an analytical model of jobs running on a multi-core machine. We prove that EQUI, a policy which continuously divides cores evenly across jobs, is optimal when all jobs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Cloud Computing and Resource Management
