LSRAM: A Lightweight Autoscaling and SLO Resource Allocation Framework for Microservices Based on Gradient Descent
Kan Hu, Minxian Xu, Kejiang Ye, Chengzhong Xu

TL;DR
LSRAM is a lightweight, gradient descent-based framework for microservice SLO resource allocation that quickly adapts to environment changes, reduces resource usage, and maintains QoS.
Contribution
It introduces a novel, lightweight SLO resource allocation model that is faster, scalable, and easier to retrain than existing complex models.
Findings
Reduces resource usage by 17% compared to state-of-the-art methods.
Effectively handles bursty traffic and fluctuating loads.
Quickly adapts to environment changes with two-stage update model.
Abstract
Microservices architecture has become the dominant architecture in cloud computing paradigm with its advantages of facilitating development, deployment, modularity and scalability. The workflow of microservices architecture is transparent to the users, who are concerned with the quality of service (QoS). Taking Service Level Objective (SLO) as an important indicator of system resource scaling can effectively ensure user's QoS, but how to quickly allocate end-to-end SLOs to each microservice in a complete service so that it can obtain the optimal SLO resource allocation scheme is still a challenging problem. Existing microservice autoscaling frameworks based on SLO resources often have heavy and complex models that demand substantial time and computational resources to get a suitable resource allocation scheme. Moreover, when the system environment or microservice application changes,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Software System Performance and Reliability · IoT and Edge/Fog Computing
