Scaling Power Management in Cloud Data Centers: A Multi-Level Continuous-Time MDP Approach
Behzad Chitsaz, Ahmad Khonsari, Masoumeh Moradian, Aresh Dadlani,, Mohammad Sadegh Talebi

TL;DR
This paper introduces a multi-level continuous-time Markov decision process model for power management in large-scale cloud data centers, effectively reducing complexity while maintaining accuracy and improving performance over existing threshold-based methods.
Contribution
It proposes a novel multi-level CTMDP approach with state aggregation that overcomes scalability issues and achieves near-optimal power management in data centers.
Findings
50% reduction in CTMDP size
Better rewards than threshold-based policies
Effective in various scenarios
Abstract
Power management in multi-server data centers~especially at scale is a vital issue of increasing importance in cloud computing paradigm. Existing studies mostly consider thresholds on the number of idle servers to switch the servers on or off and suffer from scalability issues. As a natural approach in view~of~the Markovian assumption, we present a multi-level continuous-time Markov decision process (CTMDP) model based on state aggregation of multi-server data centers with setup times that interestingly overcomes the inherent intractability of traditional MDP approaches due to their colossal state-action space. The beauty of the presented model is that, while it keeps loyalty to the Markovian behavior, it approximates the calculation of the transition probabilities in a way that keeps the accuracy of the results at a desirable level. Moreover, near-optimal performance is attained at the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Age of Information Optimization
