A Fault Tolerant Elastic Resource Management Framework Towards High Availability of Cloud Services
Deepika Saxena, Ishu Gupta, Ashutosh Kumar Singh, and Chung-Nan Lee

TL;DR
This paper introduces a fault-tolerant resource management framework for cloud services that predicts failures and proactively manages VMs and servers to enhance availability and reduce migration and power costs.
Contribution
It presents a novel FT-ERM framework with failure prediction and proactive VM migration, improving cloud service availability and efficiency.
Findings
Service availability increased by up to 34.47%.
VM migration reduced by up to 88.6%.
Power consumption decreased by up to 62.4%.
Abstract
Cloud computing has become inevitable for every digital service which has exponentially increased its usage. However, a tremendous surge in cloud resource demand stave off service availability resulting into outages, performance degradation, load imbalance, and excessive power-consumption. The existing approaches mainly attempt to address the problem by using multi-cloud and running multiple replicas of a virtual machine (VM) which accounts for high operational-cost. This paper proposes a Fault Tolerant Elastic Resource Management (FT-ERM) framework that addresses aforementioned problem from a different perspective by inducing high-availability in servers and VMs. Specifically, (1) an online failure predictor is developed to anticipate failure-prone VMs based on predicted resource contention; (2) the operational status of server is monitored with the help of power analyser, resource…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
