Achieving Zero Asymptotic Queueing Delay for Parallel Jobs
Wentao Weng, Weina Wang

TL;DR
This paper introduces a new model for parallel jobs in large-scale systems and demonstrates that zero queueing delay can be achieved with a smaller probe overhead using a batch-filling policy, under certain conditions.
Contribution
It proposes a novel notion of zero delay for parallel jobs and shows that it can be achieved with reduced probe overhead using a modified policy.
Findings
Zero delay is achievable with probe overhead d = ω(1/((1-λ) log k))
Zero delay cannot be achieved if d = exp(o(log N / log k))
The results apply in the sub-Halfin-Whitt heavy-traffic regime
Abstract
Zero queueing delay is highly desirable in large-scale computing systems. Existing work has shown that it can be asymptotically achieved by using the celebrated Power-of--choices (pod) policy with a probe overhead , and it is impossible when , where is the number of servers and is the load of the system. However, these results are based on the model where each job is an indivisible unit, which does not capture the parallel structure of jobs in today's predominant parallel computing paradigm. This paper thus considers a model where each job consists of a batch of parallel tasks. Under this model, we propose a new notion of zero (asymptotic) queueing delay that requires the job delay under a policy to approach the job delay given by the max of its tasks' service times, i.e., the job…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Queuing Theory Analysis · Age of Information Optimization · Cloud Computing and Resource Management
