Global Optimization of Data Pipelines in Heterogeneous Cloud Environments
Erica Lin, Luna Xu, Suraj Bramhavar, Marco Montes de Oca, Sean Gorsky,, Lingyun Yi, Arianna Groetsema, Jeffrey Chou

TL;DR
This paper introduces AGORA, a scheduler that optimizes resource allocation and task scheduling for data pipelines in heterogeneous cloud environments, significantly improving performance and reducing costs.
Contribution
AGORA jointly optimizes resource configuration and scheduling for DAG workflows in heterogeneous clouds, addressing the NP-hard problem more effectively than existing solutions.
Findings
Up to 45% performance improvement
Up to 77% cost reduction
65% cost savings on real-world Alibaba trace
Abstract
Modern production data processing and machine learning pipelines on the cloud are critical components for many cloud-based companies. These pipelines are typically composed of complex workflows represented by directed acyclic graphs (DAGs). Cloud environments are attractive to these workflows due to the wide range of choice with heterogeneous instances and prices that can provide the flexibility for different cost-performance needs. However, this flexibility also leads to the complexity of selecting the right resource configuration (e.g., instance type, resource demands) for each task in the DAG, while simultaneously scheduling the tasks with the selected resources to reach the optimal end-to-end performance and cost. These two decisions are often codependent resulting in an NP-hard scheduling optimization bottleneck. Existing solutions only focus solely on either problem and ignore the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Graph Theory and Algorithms · IoT and Edge/Fog Computing
