Synergy via Redundancy: Adaptive Replication Strategies and Fundamental Limits
Gauri Joshi, Dhruva Kaushal

TL;DR
This paper investigates the fundamental limits of throughput enhancement in multi-server systems through adaptive job replication, proposing policies and bounds that optimize redundancy strategies for maximum efficiency.
Contribution
It introduces adaptive replication policies including MaxRate and AdaRep, expanding beyond upfront strategies, and derives upper bounds to quantify the maximum achievable throughput boost.
Findings
Adaptive policies improve throughput over static methods.
Upper bounds establish fundamental limits on redundancy gains.
Proposed policies approach the theoretical maximum throughput.
Abstract
The maximum possible throughput (or the rate of job completion) of a multi-server system is typically the sum of the service rates of individual servers. Recent work shows that launching multiple replicas of a job and canceling them as soon as one copy finishes can boost the throughput, especially when the service time distribution has high variability. This means that redundancy can, in fact, create synergy among servers such that their overall throughput is greater than the sum of individual servers. This work seeks to find the fundamental limit of the throughput boost achieved by job replication and the optimal replication policy to achieve it. While most previous works consider upfront replication policies, we expand the set of possible policies to delayed launch of replicas. The search for the optimal adaptive replication policy can be formulated as a Markov Decision Process, using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
