Deep Reinforcement Learning Approach to QoSAware Load Balancing in 5G Cellular Networks under User Mobility and Observation Uncertainty
Mehrshad Eskandarpour, Hossein Soleimani

TL;DR
This paper introduces a deep reinforcement learning framework using PPO for autonomous, QoS-aware load balancing in 5G networks, demonstrating superior performance and stability over existing methods in a Python simulation environment.
Contribution
The paper develops a novel PPO-based deep RL approach for load balancing in 5G, explicitly balancing efficiency and stability under mobility and noise, with comprehensive evaluation.
Findings
PPO outperforms rule-based and other learning baselines across key KPIs.
The approach achieves rapid, stable convergence in simulated stress tests.
The method maintains strong generalization as user load increases.
Abstract
Efficient mobility management and load balancing are critical to sustaining Quality of Service (QoS) in dense, highly dynamic 5G radio access networks. We present a deep reinforcement learning framework based on Proximal Policy Optimization (PPO) for autonomous, QoS-aware load balancing implemented end-to-end in a lightweight, pure-Python simulation environment. The control problem is formulated as a Markov Decision Process in which the agent periodically adjusts Cell Individual Offset (CIO) values to steer user-cell associations. A multi-objective reward captures key performance indicators (aggregate throughput, latency, jitter, packet loss rate, Jain's fairness index, and handover count), so the learned policy explicitly balances efficiency and stability under user mobility and noisy observations. The PPO agent uses an actor-critic neural network trained from trajectories generated by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
