Optimizing Stochastic Scheduling in Fork-Join Queueing Models: Bounds and Applications
Wasiur R. KhudaBukhsh, Amr Rizk, Alexander Fr\"ommgen, Heinz, Koeppl

TL;DR
This paper develops stochastic bounds and optimal scheduling strategies for heterogeneous Fork-Join queueing systems, balancing parallelization benefits and synchronization penalties to improve system performance in applications like MapReduce and RAID.
Contribution
It introduces a generalized probabilistic framework for server scheduling in FJ systems, providing bounds and optimal strategies tailored to different parallelization regimes.
Findings
Derived computable bounds for waiting and response times.
Identified trade-offs between parallelization benefits and synchronization penalties.
Proposed application-specific optimal scheduling strategies.
Abstract
Fork-Join (FJ) queueing models capture the dynamics of system parallelization under synchronization constraints, for example, for applications such as MapReduce, multipath transmission and RAID systems. Arriving jobs are first split into tasks and mapped to servers for execution, such that a job can only leave the system when all of its tasks are executed. In this paper, we provide computable stochastic bounds for the waiting and response time distributions for heterogeneous FJ systems under general parallelization benefit. Our main contribution is a generalized mathematical framework for probabilistic server scheduling strategies that are essentially characterized by a probability distribution over the number of utilized servers, and the optimization thereof. We highlight the trade-off between the scaling benefit due to parallelization and the FJ inherent synchronization penalty.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Advanced Queuing Theory Analysis · Distributed systems and fault tolerance
