Digital Twin Assisted Risk-Aware Sleep Mode Management Using Deep Q-Networks
Meysam Masoudi, Ebrahim Soroush, Jens Zander, and Cicek Cavdar

TL;DR
This paper presents a risk-aware, deep reinforcement learning approach using digital twins to optimize sleep modes in base stations, significantly reducing energy consumption while maintaining user delay constraints.
Contribution
It introduces a novel digital twin-based risk estimation method integrated with deep Q-learning for energy-efficient BS sleep management.
Findings
Significant energy savings achieved with minimal user delay.
Effective risk prediction enables safer sleep mode activation.
Proposed traffic data augmentation improves model training.
Abstract
Base stations (BSs) are the most energy-consuming segment of mobile networks. To reduce BS energy consumption, different components of BSs can sleep when BS is not active. According to the activation/deactivation time of the BS components, multiple sleep modes (SMs) are defined in the literature. In this study, we model the problem of BS energy saving utilizing multiple sleep modes as a sequential MDP and propose an online traffic-aware deep reinforcement learning approach to maximize the long-term energy saving. However, there is a risk that BS is not sleeping at the right time and incurs large delays to the users. To tackle this issue, we propose to use a digital twin model to encapsulate the dynamics underlying the investigated system and estimate the risk of decision-making (RDM) in advance. We define a novel metric to quantify RDM and predict the performance degradation. The RDM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Advanced MIMO Systems Optimization · IoT and Edge/Fog Computing
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Experience Replay
