A Meta Reinforcement Learning Approach for Predictive Autoscaling in the   Cloud

Siqiao Xue; Chao Qu; Xiaoming Shi; Cong Liao; Shiyi Zhu; Xiaoyu Tan,; Lintao Ma; Shiyu Wang; Shijun Wang; Yun Hu; Lei Lei; Yangfei Zheng; Jianguo; Li; James Zhang

arXiv:2205.15795·cs.LG·June 1, 2022

A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud

Siqiao Xue, Chao Qu, Xiaoming Shi, Cong Liao, Shiyi Zhu, Xiaoyu Tan,, Lintao Ma, Shiyu Wang, Shijun Wang, Yun Hu, Lei Lei, Yangfei Zheng, Jianguo, Li, James Zhang

PDF

1 Repo

TL;DR

This paper introduces a meta reinforcement learning approach with workload prediction and neural processes to improve cloud autoscaling, achieving high accuracy and adaptability in dynamic environments, and is deployed at Alipay.

Contribution

It presents a novel end-to-end meta RL algorithm with workload prediction and neural processes for more accurate and adaptive cloud autoscaling.

Findings

01

Significant performance improvements over existing algorithms.

02

Successful deployment at Alipay for real-world cloud autoscaling.

03

Enhanced adaptability to workload fluctuations.

Abstract

Predictive autoscaling (autoscaling with workload forecasting) is an important mechanism that supports autonomous adjustment of computing resources in accordance with fluctuating workload demands in the Cloud. In recent works, Reinforcement Learning (RL) has been introduced as a promising approach to learn the resource management policies to guide the scaling actions under the dynamic and uncertain cloud environment. However, RL methods face the following challenges in steering predictive autoscaling, such as lack of accuracy in decision-making, inefficient sampling and significant variability in workload patterns that may cause policies to fail at test time. To this end, we propose an end-to-end predictive meta model-based RL algorithm, aiming to optimally allocate resource to maintain a stable CPU utilization level, which incorporates a specially-designed deep periodic workload…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ilevyfan/meta_rl_scaling
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.