Practical Efficient Microservice Autoscaling with QoS Assurance
Md Rajib Hossen, Mohammad A. Islam, and Kishwar Ahmed

TL;DR
This paper introduces PEMA, a lightweight and adaptive autoscaling method for microservices that reduces resource usage by up to 33% while maintaining QoS, addressing the limitations of existing ML-based approaches.
Contribution
PEMA is a novel, workload-aware autoscaling framework that efficiently manages microservice resources through opportunistic reduction, improving over existing methods.
Findings
PEMA reduces resource usage by up to 33%.
PEMA maintains QoS in microservice deployments.
PEMA adapts efficiently to changing workloads.
Abstract
Cloud applications are increasingly moving away from monolithic services to agile microservices-based deployments. However, efficient resource management for microservices poses a significant hurdle due to the sheer number of loosely coupled and interacting components. The interdependencies between various microservices make existing cloud resource autoscaling techniques ineffective. Meanwhile, machine learning (ML) based approaches that try to capture the complex relationships in microservices require extensive training data and cause intentional SLO violations. Moreover, these ML-heavy approaches are slow in adapting to dynamically changing microservice operating environments. In this paper, we propose PEMA (Practical Efficient Microservice Autoscaling), a lightweight microservice resource manager that finds efficient resource allocation through opportunistic resource reduction.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Software System Performance and Reliability · IoT and Edge/Fog Computing
