Reinforcement Learning with Reward Machines for Sleep Control in Mobile Networks
Kristina Levina, Nikolaos Pappas, Athanasios Karapantelakis, Aneta Vulgarakis Feljan, and Jendrik Seipp

TL;DR
This paper introduces a reinforcement learning framework using reward machines to optimize sleep control in mobile networks, balancing energy savings with QoS constraints like packet drop rates and throughput.
Contribution
It presents a novel RL approach with reward machines that handle history-dependent constraints for energy-efficient mobile network management.
Findings
Effective balancing of energy savings and QoS constraints demonstrated.
Reward machines successfully manage non-Markovian, history-dependent decision problems.
Scalable framework applicable to diverse traffic and QoS scenarios.
Abstract
Energy efficiency in mobile networks is crucial for sustainable telecommunications infrastructure, particularly as network densification continues to increase power consumption. Sleep mechanisms for the components in mobile networks can reduce energy use, but deciding which components to put to sleep, when, and for how long while preserving quality of service (QoS) remains a difficult optimisation problem. In this paper, we utilise reinforcement learning with reward machines (RMs) to make sleep-control decisions that balance immediate energy savings and long-term QoS impact, i.e. time-averaged packet drop rates for deadline-constrained traffic and time-averaged minimum-throughput guarantees for constant-rate users. A challenge is that time-averaged constraints depend on cumulative performance over time rather than immediate performance. As a result, the effective reward is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
