Scheduling Methods to Reduce Response Latency of Function as a Service
Pawel Zuk, Krzysztof Rzadca

TL;DR
This paper proposes scheduling heuristics for Function as a Service (FaaS) that leverage knowledge of invocation sequences to reduce response latency, especially when setup times are significant.
Contribution
It introduces a scheduling model for FaaS that incorporates invocation sequencing, environment deployment, and allocation, with heuristics demonstrating improved latency reduction.
Findings
Heuristics outperform greedy algorithms when setup times are long.
Knowledge of invocation sequences reduces response latency.
Simulation results show significant latency improvements with informed scheduling.
Abstract
Function as a Service (FaaS) permits cloud customers to deploy to cloud individual functions, in contrast to complete virtual machines or Linux containers. All major cloud providers offer FaaS products (Amazon Lambda, Google Cloud Functions, Azure Serverless); there are also popular open-source implementations (Apache OpenWhisk) with commercial offerings (Adobe I/O Runtime, IBM Cloud Functions). A new feature of FaaS is function composition: a function may (sequentially) call another function, which, in turn, may call yet another function - forming a chain of invocations. From the perspective of the infrastructure, a composed FaaS is less opaque than a virtual machine or a container. We show that this additional information enables the infrastructure to reduce the response latency. In particular, knowing the sequence of future invocations, the infrastructure can schedule these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
