FLAS: a combination of proactive and reactive auto-scaling architecture for distributed services
V\'ictor Ramp\'erez, Javier Soriano, David Lizcano, Juan A. Lara

TL;DR
FLAS is an auto-scaling system for distributed services that intelligently combines predictive and reactive strategies to optimize resource management and maintain service levels, demonstrated on a publish-subscribe middleware.
Contribution
The paper introduces FLAS, a novel auto-scaler that integrates predictive modeling with reactive adjustments, specifically designed for distributed systems like publish-subscribe architectures.
Findings
Ensures performance compliance over 99% of the time
Effectively anticipates SLA parameter changes
Reduces instrumentation invasiveness
Abstract
Cloud computing has established itself as the support for the vast majority of emerging technologies, mainly due to the characteristic of elasticity it offers. Auto-scalers are the systems that enable this elasticity by acquiring and releasing resources on demand to ensure an agreed service level. In this article we present FLAS (Forecasted Load Auto-Scaling), an auto-scaler for distributed services that combines the advantages of proactive and reactive approaches according to the situation to decide the optimal scaling actions in every moment. The main novelties introduced by FLAS are (i) a predictive model of the high-level metrics trend which allows to anticipate changes in the relevant SLA parameters (e.g. performance metrics such as response time or throughput) and (ii) a reactive contingency system based on the estimation of high-level metrics from resource use metrics, reducing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
