Cloud Instance Management and Resource Prediction For Computation-as-a-Service Platforms
Joseph Doyle, Vasileios Giotsas, Mohammad Ashraful Anam, Yiannis, Andreopoulos

TL;DR
This paper presents integrated approaches for resource prediction, task assignment, and instance control in CaaS platforms, significantly reducing costs compared to existing solutions like Amazon Lambda and Autoscale.
Contribution
It introduces a novel combination of proportional fairness, Kalman filtering, and AIMD algorithms for efficient cloud resource management in CaaS platforms.
Findings
27% cost reduction over reactive resource prediction methods
38% to 60% cost reduction compared to Amazon Lambda and Autoscale
Effective integration of three approaches improves resource efficiency
Abstract
Computation-as-a-Service (CaaS) offerings have gained traction in the last few years due to their effectiveness in balancing between the scalability of Software-as-a-Service and the customisation possibilities of Infrastructure-as-a-Service platforms. To function effectively, a CaaS platform must have three key properties: (i) reactive assignment of individual processing tasks to available cloud instances (compute units) according to availability and predetermined time-to-completion (TTC) constraints; (ii) accurate resource prediction; (iii) efficient control of the number of cloud instances servicing workloads, in order to optimize between completing workloads in a timely fashion and reducing resource utilization costs. In this paper, we propose three approaches that satisfy these properties (respectively): (i) a service rate allocation mechanism based on proportional fairness and TTC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
