Taming Cold Starts: Proactive Serverless Scheduling with Model Predictive Control
Chanh Nguyen, Monowar Bhuyan, Erik Elmroth

TL;DR
This paper introduces a proactive serverless scheduling approach using Model Predictive Control to forecast invocations, optimize container prewarming, and reduce cold start latency and resource consumption.
Contribution
It presents a novel predictive scheduling framework that proactively mitigates cold starts in serverless platforms, improving latency and efficiency.
Findings
Achieves up to 85% lower tail latency.
Reduces resource usage by 34%.
Outperforms existing baselines in experiments.
Abstract
Serverless computing has transformed cloud application deployment by introducing a fine-grained, event-driven execution model that abstracts away infrastructure management. Its on-demand nature makes it especially appealing for latency-sensitive and bursty workloads. However, the cold start problem, i.e., where the platform incurs significant delay when provisioning new containers, remains the Achilles' heel of such platforms. This paper presents a predictive serverless scheduling framework based on Model Predictive Control to proactively mitigate cold starts, thereby improving end-to-end response time. By forecasting future invocations, the controller jointly optimizes container prewarming and request dispatching, improving latency while minimizing resource overhead. We implement our approach on Apache OpenWhisk, deployed on a Kubernetes-based testbed. Experimental results using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
