VM Scaling and Load Balancing via Cost Optimal MDP Solution
Mark Shifrin, Roy Mitrany, Erez Biton, Omer Gurewitz

TL;DR
This paper presents a cost-optimal load balancing strategy for cloud VMs using MDP, balancing deployment costs, SLA constraints, and performance metrics, validated on AWS.
Contribution
It introduces a novel MDP-based approach for optimizing VM deployment and load balancing considering costs, SLAs, and performance, with threshold-based policies.
Findings
Optimal policies have decision thresholds based on system parameters.
MDP solutions effectively balance costs and performance constraints.
Validation on AWS demonstrates practical applicability.
Abstract
We address a cost optimization problem faced by a user who runs instances of applications in a remote cloud configuration constructed of multiple virtual machines (VMs). Each VM runs a single application instance which can execute tasks specific to that application. Managing the VMs involves a sophisticated trade-off between cloud-related demands, which are expressed by the provisional costs of leased cloud resources, and exogenous cost demands expressed by service revenues that are typically bound to SLAs. The internal costs may include VM deployment/termination cost, and VM lease cost. The exogenous costs refer to rewards accumulated due to the successfully accomplished tasks being run by each application instance. In the case where the SLA restricts performance to a certain load level at each VM, tasks incoming at VMs that reached that level are rejected. Rejections cause fines…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
