Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-Making
Yang Luo, Shiyu Wang, Zhemeng Yu, Wei Lu, Xiaofeng Gao, Lintao Ma,, Guihai Chen

TL;DR
HARMONY is an adaptive cloud resource management system that uses hierarchical forecasting and Bayesian decision-making to improve resource utilization and reduce costs in large-scale data centers.
Contribution
The paper introduces HARMONY, a novel system combining hierarchical indicator modeling with uncertainty-aware Bayesian optimization for adaptive cloud resource scaling.
Findings
Outperforms nine existing methods in large-scale evaluations.
Achieves over 35,000 GPU hours savings in real-world deployment.
Reduces costs by more than $100,000 through adaptive resource management.
Abstract
The surging demand for cloud computing resources, driven by the rapid growth of sophisticated large-scale models and data centers, underscores the critical importance of efficient and adaptive resource allocation. As major tech enterprises deploy massive infrastructures with thousands of GPUs, existing cloud platforms still struggle with low resource utilization due to key challenges: capturing hierarchical indicator structures, modeling non-Gaussian distributions, and decision-making under uncertainty. To address these challenges, we propose HRAMONY, an adaptive Hierarchical Attention-based Resource Modeling and Decision-Making System. HARMONY combines hierarchical multi-indicator distribution forecasting and uncertainty-aware Bayesian decision-making. It introduces a novel hierarchical attention mechanism that comprehensively models complex inter-indicator dependencies, enabling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management
MethodsSoftmax · Attention Is All You Need · Normalizing Flows
