ADAPT: A Self-Calibrating Proactive Autoscaler for Container Orchestration

Himanshu Singh Baghel

arXiv:2605.15788·cs.DC·May 18, 2026

ADAPT: A Self-Calibrating Proactive Autoscaler for Container Orchestration

Himanshu Singh Baghel

PDF

TL;DR

ADAPT introduces an online estimator and a dynamic planning approach for proactive autoscaling in container orchestration, significantly reducing SLA violations across diverse workloads.

Contribution

It presents a novel self-calibrating autoscaler that adapts its lookahead based on measured provisioning delays using an EWMA estimator and MPC, improving scaling accuracy.

Findings

01

MPC+LSTM achieves below 5% SLA violation on all workloads.

02

Compared to reactive HPA, MPC+LSTM reduces violations from 7-19%.

03

MPC+Prophet can have violations up to 28.7% on bimodal traffic.

Abstract

Proactive autoscaling for containerized workloads depends on knowing the provisioning delay, i.e., the time between a scaling decision and the moment new capacity is ready to serve traffic. In practice, this cold-start duration can vary substantially across environments and even across consecutive scale-out events. We present ADAPT (Adaptive Duration Approximation for Predictive Timing), an online EWMA estimator that tracks coldstart duration at runtime. ADAPT feeds a dynamic planning horizon, FH-OPT, into a Model Predictive Controller (MPC) that optimizes replica counts over a rolling window. Together, these components form a closed-loop proactive autoscaling design that adapts its lookahead based on measured provisioning delay. Evaluated across three policies (MPC+LSTM, MPC+Prophet, HPA) and six workload archetypes with five random seeds, MPC+LSTM achieves below 5% SLA violation on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.