QoS-Aware Power Minimization of Distributed Many-Core Servers using Transfer Q-Learning
Dainius Jenkus, Fei Xia, Rishad Shafik, Alex Yakovlev

TL;DR
This paper introduces a transfer Q-learning based runtime control method for distributed many-core servers that dynamically adjusts resource allocation to minimize power consumption while maintaining QoS guarantees.
Contribution
It presents a novel transfer Q-learning approach combined with horizontal and vertical scaling for energy-efficient QoS management in distributed servers.
Findings
Reduces power consumption compared to model-free Q-learning.
Minimizes QoS violations under dynamic workloads.
Achieves scalable and portable runtime control for heterogeneous server clusters.
Abstract
Web servers scaled across distributed systems necessitate complex runtime controls for providing quality of service (QoS) guarantees as well as minimizing the energy costs under dynamic workloads. This paper presents a QoS-aware runtime controller using horizontal scaling (node allocation) and vertical scaling (resource allocation within nodes) methods synergistically to provide adaptation to workloads while minimizing the power consumption under QoS constraint (i.e., response time). A horizontal scaling determines the number of active nodes based on workload demands and the required QoS according to a set of rules. Then, it is coupled with vertical scaling using transfer Q-learning, which further tunes power/performance based on workload profile using dynamic voltage/frequency scaling (DVFS). It transfers Q-values within minimally explored states reducing exploration requirements. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Software-Defined Networks and 5G · IoT and Edge/Fog Computing
Methodstravel james
