DataFlower: Exploiting the Data-flow Paradigm for Serverless Workflow Orchestration
Zijun Li, Chuhao Xu, Quan Chen, Jieru Zhao, Chen Chen, Minyi Guo

TL;DR
DataFlower introduces a data-flow based approach to serverless workflow orchestration, significantly reducing latency and increasing throughput by optimizing data transmission and asynchronous processing.
Contribution
It presents a novel data-flow paradigm for serverless workflows, with a container abstraction and communication mechanism to enhance performance over traditional control-flow methods.
Findings
Reduces 99th percentile latency by up to 35.4%.
Improves peak throughput by up to 3.8 times.
Demonstrates effectiveness through experimental evaluation.
Abstract
Serverless computing that runs functions with auto-scaling is a popular task execution pattern in the cloud-native era. By connecting serverless functions into workflows, tenants can achieve complex functionality. Prior researches adopt the control-flow paradigm to orchestrate a serverless workflow. However, the control-flow paradigm inherently results in long response latency, due to the heavy data persistence overhead, sequential resource usage, and late function triggering. Our investigation shows that the data-flow paradigm has the potential to resolve the above problems, with careful design and optimization. We propose DataFlower, a scheme that achieves the data-flow paradigm for serverless workflows. In DataFlower, a container is abstracted to be a function logic unit and a data logic unit. The function logic unit runs the functions, and the data logic unit handles the data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Scientific Computing and Data Management · Distributed and Parallel Computing Systems
