Combining Serverless and High-Performance Computing Paradigms to support ML Data-Intensive Applications
Mills Staylor, Arup Kumar Sarker, Gregor von Laszewski, Geoffrey Fox, Yue Cheng, Judy Fox

TL;DR
This paper presents Cylon, a high-performance distributed data frame system that combines serverless and HPC paradigms, enabling efficient large-scale data processing in cloud environments with near-HPC performance.
Contribution
The paper introduces a novel serverless communicator for distributed data processing, achieving HPC-like efficiency on AWS Lambda for data-intensive applications.
Findings
Scaling efficiency within 6.5% of serverful AWS at 64 nodes
Demonstrates effective direct communication via NAT Traversal TCP Hole Punching
Shows promising results for Python-based data processing in serverless environments
Abstract
Data is found everywhere, from health and human infrastructure to the surge of sensors and the proliferation of internet-connected devices. To meet this challenge, the data engineering field has expanded significantly in recent years in both research and industry. Traditionally, data engineering, Machine Learning, and AI workloads have been run on large clusters within data center environments, requiring substantial investment in hardware and maintenance. With the rise of the public cloud, it is now possible to run large applications across nodes without owning or maintaining hardware. Serverless functions such as AWS Lambda provide horizontal scaling and precise billing without the hassle of managing traditional cloud infrastructure. However, when processing large datasets, users often rely on external storage options that are significantly slower than direct communication typical of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Big Data and Digital Economy
