Towards Serverless Optimization with In-place Scaling

Vincent Hsieh; Jerry Chou

arXiv:2311.09526·cs.DC·November 17, 2023·1 cites

Towards Serverless Optimization with In-place Scaling

Vincent Hsieh, Jerry Chou

PDF

Open Access

TL;DR

This paper investigates how in-place scaling in Kubernetes v1.27 can reduce cold start latency in serverless computing, demonstrating significant improvements in request response times across different workloads.

Contribution

It provides an empirical analysis of in-place scaling's impact on serverless latency, highlighting its potential to mitigate cold start delays.

Findings

01

Latency reductions of 1.16 to 18.15 times across workloads

02

In-place scaling improves serverless responsiveness

03

Experimental results validate the effectiveness of in-place scaling

Abstract

Serverless computing has gained popularity due to its cost efficiency, ease of deployment, and enhanced scalability. However, in serverless environments, servers are initiated only after receiving a request, leading to increased response times. This delay is commonly known as the cold start problem. In this study, we explore the in-place scaling feature released in Kubernetes v1.27 and examine its impact on serverless computing. Our experimental results reveal improvements in request latency, with reductions ranging from 1.16 to 18.15 times across various workloads when compared to traditional cold policy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Software-Defined Networks and 5G