On Optimal Server Allocation for Moldable Jobs with Concave Speed-Up
Samira Ghanbarian, Arpan Mukhopadhyay, Ravi R. Mazumdar, Fabrice M., Guillemin

TL;DR
This paper proposes an optimal server allocation scheme for moldable jobs with concave speed-up functions, minimizing average execution time and ensuring low blocking probability in large systems.
Contribution
It introduces a simple, asymptotically optimal server allocation policy that minimizes average job execution time and reduces blocking probability in large-scale heterogeneous systems.
Findings
Achieves minimum average execution time for accepted jobs.
Blocking probability vanishes as system size grows.
Performance is robust across different job time distributions.
Abstract
A large proportion of jobs submitted to modern computing clusters and data centers are parallelizable and capable of running on a flexible number of computing cores or servers. Although allocating more servers to such a job results in a higher speed-up in the job's execution, it reduces the number of servers available to other jobs, which in the worst case, can result in an incoming job not finding any available server to run immediately upon arrival. Hence, a key question to address is: how to optimally allocate servers to jobs such that (i) the average execution time across jobs is minimized and (ii) almost all jobs find at least one server immediately upon arrival. To address this question, we consider a system with servers, where jobs are parallelizable up to servers and the speed-up function of jobs is concave and increasing. Jobs not finding any available servers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Optimization and Search Problems · Advanced Queuing Theory Analysis
