Performance considerations on execution of large scale workflow applications on cloud functions
Maciej Pawlik, Kamil Figiela, Maciej Malawski

TL;DR
This paper evaluates the performance of major cloud Function-as-a-Service providers for scientific workflows, highlighting performance bottlenecks, overheads, and infrastructure provisioning challenges in large-scale parallel task execution.
Contribution
It provides a comprehensive benchmarking framework and analysis of FaaS platforms for scientific workflows, revealing key performance insights and limitations.
Findings
API overhead significantly impacts workflow run time
Performance varies with function size and parallelism
Provisioning is limited by rate and parallelism constraints
Abstract
Function-as-a-Service is a novel type of cloud service used for creating distributed applications and utilizing computing resources. Application developer supplies source code of cloud functions, which are small applications or application components, while the service provider is responsible for provisioning the infrastructure, scaling and exposing a REST style API. This environment seems to be adequate for running scientific workflows, which in recent years, have become an established paradigm for implementing and preserving complex scientific processes. In this paper, we present work done on evaluating three major FaaS providers (Amazon, Google, IBM) as a platform for running scientific workflows. The experiments were performed with a dedicated benchmarking framework, which consisted of instrumented workflow execution engine. The testing load was implemented as a large scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Distributed and Parallel Computing Systems · Cloud Computing and Resource Management
