Terra: Scalable Cross-Layer GDA Optimizations
Jie You, Mosharaf Chowdhury

TL;DR
Terra introduces scalable cross-layer optimizations for geo-distributed analytics, significantly reducing WAN transfer times and improving deadline adherence by dynamically reacting to network fluctuations.
Contribution
It presents a novel scalable algorithm and overlay enforcement mechanism for joint WAN routing and scheduling tailored for GDA frameworks.
Findings
Improves GDA job completion times by up to 3.43x.
Enables GDA jobs to meet 2.82x-4.29x more deadlines.
Quickly reacts to WAN fluctuations and failures.
Abstract
Geo-distributed analytics (GDA) frameworks transfer large datasets over the wide-area network (WAN). Yet existing frameworks often ignore the WAN topology. This disconnect between WAN-bound applications and the WAN itself results in missed opportunities for cross-layer optimizations. In this paper, we present Terra to bridge this gap. Instead of decoupled WAN routing and GDA transfer scheduling, Terra applies scalable cross-layer optimizations to minimize WAN transfer times for GDA jobs. We present a two-pronged approach: (i) a scalable algorithm for joint routing and scheduling to make fast decisions; and (ii) a scalable, overlay-based enforcement mechanism that avoids expensive switch rule updates in the WAN. Together, they enable Terra to quickly react to WAN uncertainties such as large bandwidth fluctuations and failures in an application-aware manner as well. Integration with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Advanced Data Storage Technologies
