Self-Aware and Self-Adaptive Autoscaling for Cloud Based Services
Tao Chen

TL;DR
This paper introduces a comprehensive self-aware and self-adaptive autoscaling framework for cloud services, improving automatic resource management by addressing uncertainties, interference, and trade-offs without heavy manual intervention.
Contribution
It presents a holistic autoscaling framework leveraging self-awareness principles to handle cloud dynamics, uncertainties, and QoS trade-offs effectively and seamlessly.
Findings
Framework improves autoscaling efficiency and responsiveness.
Experimental results show better QoS and resource utilization.
Outperforms existing state-of-the-art approaches.
Abstract
Modern Internet services are increasingly leveraging on cloud computing for flexible, elastic and on-demand provision. Typically, Quality of Service (QoS) of cloud-based services can be tuned using different underlying cloud configurations and resources, e.g., number of threads, CPU and memory etc., which are shared, leased and priced as utilities. This benefit is fundamentally grounded by autoscaling: an automatic and elastic process that adapts cloud configurations on-demand according to time-varying workloads. This thesis proposes a holistic cloud autoscaling framework to effectively and seamlessly address existing challenges related to different logical aspects of autoscaling, including architecting autoscaling system, modelling the QoS of cloud-based service, determining the granularity of control and deciding trade-off autoscaling decisions. The framework takes advantages of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Software System Performance and Reliability
