Automatic Cloud Resource Scaling Algorithm based on Long Short-Term   Memory Recurrent Neural Network

Ashraf A. Shahin

arXiv:1701.03295·cs.DC·January 13, 2017

Automatic Cloud Resource Scaling Algorithm based on Long Short-Term Memory Recurrent Neural Network

Ashraf A. Shahin

PDF

TL;DR

This paper introduces a dynamic auto-scaling algorithm for cloud resources using LSTM neural networks to predict demand, improving cost efficiency and SLA compliance over traditional threshold-based methods.

Contribution

It presents a novel LSTM-based prediction approach for cloud auto-scaling, addressing workload variability and outperforming existing algorithms.

Findings

01

Proposed algorithms outperform existing auto-scaling methods.

02

LSTM-based predictions improve resource provisioning accuracy.

03

Cost savings and SLA adherence are enhanced with the new approach.

Abstract

Scalability is an important characteristic of cloud computing. With scalability, cost is minimized by provisioning and releasing resources according to demand. Most of current Infrastructure as a Service (IaaS) providers deliver threshold-based auto-scaling techniques. However, setting up thresholds with right values that minimize cost and achieve Service Level Agreement is not an easy task, especially with variant and sudden workload changes. This paper has proposed dynamic threshold based auto-scaling algorithms that predict required resources using Long Short-Term Memory Recurrent Neural Network and auto-scale virtual resources based on predicted values. The proposed algorithms have been evaluated and compared with some of existing algorithms. Experimental results show that the proposed algorithms outperform other algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.