Learning-based Two-tiered Online Optimization of Region-wide Datacenter Resource Allocation
Chang-Lin Chen, Hanhan Zhou, Jiayu Chen, Mohammad Pedramfar, Tian Lan,, Zheqing Zhu, Chi Zhou, Pol Mauri Ruiz, Neeraj Kumar, Hongbo Dong, and Vaneet, Aggarwal

TL;DR
This paper introduces a two-tiered online optimization framework combining reinforcement learning and MILP to improve resource allocation in large-scale data centers, achieving better performance and speed.
Contribution
It presents a novel two-tiered approach integrating RL and MILP for dynamic resource management in data centers, with interpretability and efficiency improvements.
Findings
Outperforms pure MIP solver by over 15%.
Achieves 100x speedup in computation.
Effectively handles large-scale RAS problems.
Abstract
Online optimization of resource management for large-scale data centers and infrastructures to meet dynamic capacity reservation demands and various practical constraints (e.g., feasibility and robustness) is a very challenging problem. Mixed Integer Programming (MIP) approaches suffer from recognized limitations in such a dynamic environment, while learning-based approaches may face with prohibitively large state/action spaces. To this end, this paper presents a novel two-tiered online optimization to enable a learning-based Resource Allowance System (RAS). To solve optimal server-to-reservation assignment in RAS in an online fashion, the proposed solution leverages a reinforcement learning (RL) agent to make high-level decisions, e.g., how much resource to select from the Main Switch Boards (MSBs), and then a low-level Mixed Integer Linear Programming (MILP) solver to generate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Software-Defined Networks and 5G · IoT and Edge/Fog Computing
