Differentiated latency in data center networks with erasure coded files through traffic engineering
Yu Xiang, Vaneet Aggarwal, Yih-Farn R. Chen, and Tian Lan

TL;DR
This paper introduces an algorithm for optimizing latency in data center networks with erasure-coded files, balancing bandwidth, request scheduling, and data placement to reduce latency for different service classes.
Contribution
It formulates a joint latency optimization problem for erasure-coded storage in data centers and proposes an efficient iterative algorithm to solve it, validated through real-world experiments.
Findings
Significant latency reduction achieved in experiments.
Validated theoretical latency bounds with practical tests.
Insights into designing low-latency data center networks.
Abstract
This paper proposes an algorithm to minimize weighted service latency for different classes of tenants (or service classes) in a data center network where erasure-coded files are stored on distributed disks/racks and access requests are scattered across the network. Due to limited bandwidth available at both top-of-the-rack and aggregation switches and tenants in different service classes need differentiated services, network bandwidth must be apportioned among different intra- and inter-rack data flows for different service classes in line with their traffic statistics. We formulate this problem as weighted queuing and employ a class of probabilistic request scheduling policies to derive a closed-form upper-bound of service latency for erasure-coded storage with arbitrary file access patterns and service time distributions. The result enables us to propose a joint weighted latency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
