RobustScaler: QoS-Aware Autoscaling for Complex Workloads
Huajie Qian, Qingsong Wen, Liang Sun, Jing Gu, Qiulin Niu, Zhimin Tang

TL;DR
RobustScaler is a proactive autoscaling framework for cloud applications with complex, uncertain workloads, using NHPP modeling and optimization to balance resource costs and QoS guarantees.
Contribution
It introduces a novel NHPP-based autoscaling framework with an efficient ADMM training method and proven QoS guarantees for complex workload patterns.
Findings
Outperforms baseline autoscaling strategies in real-world traces.
Achieves better cost-QoS trade-offs for complex workloads.
Demonstrates robustness against workload uncertainties.
Abstract
Autoscaling is a critical component for efficient resource utilization with satisfactory quality of service (QoS) in cloud computing. This paper investigates proactive autoscaling for widely-used scaling-per-query applications where scaling is required for each query, such as container registry and function-as-a-service (FaaS). In these scenarios, the workload often exhibits high uncertainty with complex temporal patterns like periodicity, noises and outliers. Conservative strategies that scale out unnecessarily many instances lead to high resource costs whereas aggressive strategies may result in poor QoS. We present RobustScaler to achieve superior trade-off between cost and QoS. Specifically, we design a novel autoscaling framework based on non-homogeneous Poisson processes (NHPP) modeling and stochastically constrained optimization. Furthermore, we develop a specialized alternating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Stochastic Gradient Optimization Techniques · Age of Information Optimization
