Safe and Accelerated Deep Reinforcement Learning-based O-RAN Slicing: A Hybrid Transfer Learning Approach
Ahmad M. Nagib, Hatem Abou-Zeid, and Hossam S. Hassanein

TL;DR
This paper introduces a hybrid transfer learning approach to improve the safety and speed of deep reinforcement learning algorithms for O-RAN slicing, addressing convergence and stability issues in practical deployments.
Contribution
It proposes a novel hybrid transfer learning method combining policy reuse and distillation to enhance DRL training for O-RAN slicing, ensuring faster and more stable convergence.
Findings
At least 7.7% improvement in initial reward
20.7% increase in converged scenarios
64.6% reduction in reward variance
Abstract
The open radio access network (O-RAN) architecture supports intelligent network control algorithms as one of its core capabilities. Data-driven applications incorporate such algorithms to optimize radio access network (RAN) functions via RAN intelligent controllers (RICs). Deep reinforcement learning (DRL) algorithms are among the main approaches adopted in the O-RAN literature to solve dynamic radio resource management problems. However, despite the benefits introduced by the O-RAN RICs, the practical adoption of DRL algorithms in real network deployments falls behind. This is primarily due to the slow convergence and unstable performance exhibited by DRL agents upon deployment and when encountering previously unseen network conditions. In this paper, we address these challenges by proposing transfer learning (TL) as a core component of the training and deployment workflows for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFull-Duplex Wireless Communications · Software-Defined Networks and 5G · Energy Harvesting in Wireless Networks
