A Predictive Autoscaler for Elastic Batch Jobs
Peng Gao

TL;DR
This paper introduces a predictive autoscaler for elastic batch jobs that leverages time series analysis and regression models to optimize resource provisioning, reduce costs, and improve scheduling efficiency in cloud environments.
Contribution
It proposes a novel method to embed heterogeneous resource requirements into discrete buckets and a predictive autoscaler that outperforms existing methods in resource planning.
Findings
Relieves scaling planning burden
Reduces launch times and costs
Outperforms other prediction methods
Abstract
Large batch jobs such as Deep Learning, HPC and Spark require far more computational resources and higher cost than conventional online service. Like the processing of other time series data, these jobs possess a variety of characteristics such as trend, burst, and seasonality. Cloud providers offer short-term instances to achieve scalability, stability, and cost-efficiency. Given the time lag caused by joining into the cluster and initialization, crowded workloads may lead to a violation in the scheduling system. Based on the assumption that there are infinite resources and ideal placements available for users to require in the cloud environment, we propose a predictive autoscaler to provide an elastic interface for the customers and overprovision instances based on the trained regression model. We contribute to a method to embed heterogeneous resource requirements in continuous space…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Industrial Vision Systems and Defect Detection · Neural Networks and Applications
