Achieving Zero Asymptotic Queueing Delay for Parallel Jobs

Wentao Weng; Weina Wang

arXiv:2004.02081·cs.PF·November 3, 2020·1 cites

Achieving Zero Asymptotic Queueing Delay for Parallel Jobs

Wentao Weng, Weina Wang

PDF

Open Access

TL;DR

This paper introduces a new model for parallel jobs in large-scale systems and demonstrates that zero queueing delay can be achieved with a smaller probe overhead using a batch-filling policy, under certain conditions.

Contribution

It proposes a novel notion of zero delay for parallel jobs and shows that it can be achieved with reduced probe overhead using a modified policy.

Findings

01

Zero delay is achievable with probe overhead d = ω(1/((1-λ) log k))

02

Zero delay cannot be achieved if d = exp(o(log N / log k))

03

The results apply in the sub-Halfin-Whitt heavy-traffic regime

Abstract

Zero queueing delay is highly desirable in large-scale computing systems. Existing work has shown that it can be asymptotically achieved by using the celebrated Power-of- $d$ -choices (pod) policy with a probe overhead $d = ω (\frac{l o g N}{1 - λ})$ , and it is impossible when $d = O (\frac{1}{1 - λ})$ , where $N$ is the number of servers and $λ$ is the load of the system. However, these results are based on the model where each job is an indivisible unit, which does not capture the parallel structure of jobs in today's predominant parallel computing paradigm. This paper thus considers a model where each job consists of a batch of parallel tasks. Under this model, we propose a new notion of zero (asymptotic) queueing delay that requires the job delay under a policy to approach the job delay given by the max of its tasks' service times, i.e., the job…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Queuing Theory Analysis · Age of Information Optimization · Cloud Computing and Resource Management