LSRAM: A Lightweight Autoscaling and SLO Resource Allocation Framework   for Microservices Based on Gradient Descent

Kan Hu; Minxian Xu; Kejiang Ye; Chengzhong Xu

arXiv:2411.11493·cs.DC·November 19, 2024

LSRAM: A Lightweight Autoscaling and SLO Resource Allocation Framework for Microservices Based on Gradient Descent

Kan Hu, Minxian Xu, Kejiang Ye, Chengzhong Xu

PDF

Open Access

TL;DR

LSRAM is a lightweight, gradient descent-based framework for microservice SLO resource allocation that quickly adapts to environment changes, reduces resource usage, and maintains QoS.

Contribution

It introduces a novel, lightweight SLO resource allocation model that is faster, scalable, and easier to retrain than existing complex models.

Findings

01

Reduces resource usage by 17% compared to state-of-the-art methods.

02

Effectively handles bursty traffic and fluctuating loads.

03

Quickly adapts to environment changes with two-stage update model.

Abstract

Microservices architecture has become the dominant architecture in cloud computing paradigm with its advantages of facilitating development, deployment, modularity and scalability. The workflow of microservices architecture is transparent to the users, who are concerned with the quality of service (QoS). Taking Service Level Objective (SLO) as an important indicator of system resource scaling can effectively ensure user's QoS, but how to quickly allocate end-to-end SLOs to each microservice in a complete service so that it can obtain the optimal SLO resource allocation scheme is still a challenging problem. Existing microservice autoscaling frameworks based on SLO resources often have heavy and complex models that demand substantial time and computational resources to get a suitable resource allocation scheme. Moreover, when the system environment or microservice application changes,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCloud Computing and Resource Management · Software System Performance and Reliability · IoT and Edge/Fog Computing