When Do Redundant Requests Reduce Latency ?
Nihar B. Shah, Kangwook Lee, Kannan Ramchandran

TL;DR
This paper provides an analytical framework to determine when redundant requests in distributed systems reduce latency, revealing that their effectiveness depends on service time distributions and removal policies, with optimal strategies varying by load conditions.
Contribution
It introduces a model analyzing the impact of redundancy on latency, characterizing scenarios where redundancy helps or hurts, and guiding optimal redundant-request policies.
Findings
Redundant requests reduce latency with memoryless or heavier service times when jobs can be instantly removed.
Under high load, no redundancy is optimal when service times are lighter or removal is delayed.
Results apply to arbitrary arrival processes, broadening their applicability.
Abstract
Several systems possess the flexibility to serve requests in more than one way. For instance, a distributed storage system storing multiple replicas of the data can serve a request from any of the multiple servers that store the requested data, or a computational task may be performed in a compute-cluster by any one of multiple processors. In such systems, the latency of serving the requests may potentially be reduced by sending "redundant requests": a request may be sent to more servers than needed, and it is deemed served when the requisite number of servers complete service. Such a mechanism trades off the possibility of faster execution of at least one copy of the request with the increase in the delay due to an increased load on the system. Due to this tradeoff, it is unclear when redundant requests may actually help. Several recent works empirically evaluate the latency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Cloud Computing and Resource Management · Advanced Data Storage Technologies
