RepNet: Cutting Tail Latency in Data Center Networks with Flow Replication
Shuhao Liu, Wei Bai, Hong Xu, Kai Chen, Zhiping Cai

TL;DR
RepNet is an application-layer transport protocol that reduces tail latency in data center networks by using flow replication techniques, achieving over 50% latency reduction without requiring hardware modifications.
Contribution
It introduces RepNet, a deployable flow replication method at the application layer that significantly improves tail latency in data centers.
Findings
Reduces tail latency of mice flows by over 50%.
Effective in real network testbeds and Mininet simulations.
Operates efficiently on node.js platform.
Abstract
Data center networks need to provide low latency, especially at the tail, as demanded by many interactive applications. To improve tail latency, existing approaches require modifications to switch hardware and/or end-host operating systems, making them difficult to be deployed. We present the design, implementation, and evaluation of RepNet, an application layer transport that can be deployed today. RepNet exploits the fact that only a few paths among many are congested at any moment in the network, and applies simple flow replication to mice flows to opportunistically use the less congested path. RepNet has two designs for flow replication: (1) RepSYN, which only replicates SYN packets and uses the first connection that finishes TCP handshaking for data transmission, and (2) RepFlow which replicates the entire mice flow. We implement RepNet on {\tt node.js}, one of the most commonly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Software-Defined Networks and 5G · Advanced Data Storage Technologies
