Serverless Data Science -- Are We There Yet? A Case Study of Model Serving
Yuncheng Wu, Tien Tuan Anh Dinh, Guoyu Hu, Meihui Zhang, Yeow Meng, Chee, Beng Chin Ooi

TL;DR
This paper evaluates the performance and cost-effectiveness of serverless computing for model serving in data science, demonstrating its potential advantages over traditional cloud-based systems and GPU-based solutions.
Contribution
It provides a comprehensive performance and cost comparison of serverless model serving with other systems and explores the design space and challenges of serverless model serving.
Findings
Serverless outperforms many cloud-based alternatives.
In some settings, serverless surpasses GPU-based systems.
Provides design guidelines for serverless model serving.
Abstract
Machine learning (ML) is an important part of modern data science applications. Data scientists today have to manage the end-to-end ML life cycle that includes both model training and model serving, the latter of which is essential, as it makes their works available to end-users. Systems of model serving require high performance, low cost, and ease of management. Cloud providers are already offering model serving choices, including managed services and self-rented servers. Recently, serverless computing, whose advantages include high elasticity and a fine-grained cost model, brings another option for model serving. Our goal in this paper is to examine the viability of serverless as a mainstream model serving platform. To this end, we first conduct a comprehensive evaluation of the performance and cost of serverless against other model serving systems on Amazon Web Service and Google…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
