Energy Efficient Sleep Mode Optimization in 5G mmWave Networks via Multi Agent Deep Reinforcement Learning
Saad Masrur, Ismail Guvenc, David Lopez Perez

TL;DR
This paper introduces a multi-agent deep reinforcement learning framework for optimizing sleep modes in 5G mmWave networks, significantly improving energy efficiency and QoS in dynamic urban environments.
Contribution
It proposes a scalable MARL-DDQN approach for adaptive sleep mode optimization that captures non-stationary traffic and reduces signaling overhead.
Findings
Achieves up to 0.60 Mbit/Joule energy efficiency
Ensures 95% QoS constraint satisfaction
Outperforms existing strategies in dynamic scenarios
Abstract
Dynamic sleep mode optimization (SMO) in millimeter-wave (mmWave) networks is essential for maximizing energy efficiency (EE) under stringent quality-of-service (QoS) constraints. However, existing optimization and reinforcement learning (RL) approaches rely on aggregated, static base station (BS) traffic models that fail to capture non-stationary traffic dynamics and suffer from large state-action spaces, limiting real-world deployment. To address these challenges, this paper proposes a multi-agent deep reinforcement learning (MARL) framework using a Double Deep Q-Network (DDQN), referred to as MARL-DDQN, for adaptive SMO in a 3D urban environment with a time-varying and community-based user equipment (UE) mobility model. Unlike conventional single-agent RL, MARL-DDQN enables scalable, distributed decision-making with minimal signaling overhead. A realistic BS power consumption model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMillimeter-Wave Propagation and Modeling · Advanced MIMO Systems Optimization · Green IT and Sustainability
