Performance Modeling of Metric-Based Serverless Computing Platforms
Nima Mahmoudi, Hamzeh Khazaei

TL;DR
This paper develops analytical performance models for modern serverless platforms like Knative and Google Cloud Run, enabling developers to predict performance and costs based on workload parameters.
Contribution
It introduces new performance models tailored for concurrency-based autoscaling serverless platforms, filling a gap in existing analytical tools.
Findings
Models accurately predict steady-state performance with minimal data
Validation on Knative shows high prediction accuracy
Models assist in optimizing deployment configurations
Abstract
Analytical performance models are very effective in ensuring the quality of service and cost of service deployment remain desirable under different conditions and workloads. While various analytical performance models have been proposed for previous paradigms in cloud computing, serverless computing lacks such models that can provide developers with performance guarantees. Besides, most serverless computing platforms still require developers' input to specify the configuration for their deployment that could affect both the performance and cost of their deployment, without providing them with any direct and immediate feedback. In previous studies, we built such performance models for steady-state and transient analysis of scale-per-request serverless computing platforms (e.g., AWS Lambda, Azure Functions, Google Cloud Functions) that could give developers immediate feedback about the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Software System Performance and Reliability
Methodstravel james
