Latency Versus Survivability in Geo-Distributed Data Center Design
Rodrigo de Souza Couto, Stefano Secci, Miguel Elias Mitre Campista,, and Lu\'is Henrique Maciel Kosmalski Costa

TL;DR
This paper investigates the trade-off between latency and survivability in geo-distributed data centers, showing that moderate survivability requirements cause minimal latency increases, thus informing optimal design choices.
Contribution
It formulates an optimization model to analyze latency-survivability trade-offs and provides simulation results demonstrating the impact of survivability levels on latency.
Findings
Latency increase is negligible for moderate survivability requirements.
Worst-case latency remains below 4 ms with 80% server availability after failures.
Significant latency increases occur only under very high survivability demands.
Abstract
A hot topic in data center design is to envision geo-distributed architectures spanning a few sites across wide area networks, allowing more proximity to the end users and higher survivability, defined as the capacity of a system to operate after failures. As a shortcoming, this approach is subject to an increase of latency between servers, caused by their geographic distances. In this paper, we address the trade-off between latency and survivability in geo-distributed data centers, through the formulation of an optimization problem. Simulations considering realistic scenarios show that the latency increase is significant only in the case of very strong survivability requirements, whereas it is negligible for moderate survivability requirements. For instance, the worst-case latency is less than 4~ms when guaranteeing that 80% of the servers are available after a failure, in a network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
