Constraint-Aware Deep Reinforcement Learning for End-to-End Resource Orchestration in Mobile Networks
Qiang Liu, Nakjung Choi, Tao Han

TL;DR
This paper introduces SafeSlicing, a constraint-aware deep reinforcement learning approach for resource orchestration in 5G network slicing, achieving over 20% resource savings while maintaining SLA compliance.
Contribution
The paper presents a novel CaDRL algorithm with a constraint network for resource management in network slicing, combining offline training and online adaptation.
Findings
Reduces resource usage by over 20%
Maintains SLA compliance in network slices
Prototyped on a real 5G testbed
Abstract
Network slicing is a promising technology that allows mobile network operators to efficiently serve various emerging use cases in 5G. It is challenging to optimize the utilization of network infrastructures while guaranteeing the performance of network slices according to service level agreements (SLAs). To solve this problem, we propose SafeSlicing that introduces a new constraint-aware deep reinforcement learning (CaDRL) algorithm to learn the optimal resource orchestration policy within two steps, i.e., offline training in a simulated environment and online learning with the real network system. On optimizing the resource orchestration, we incorporate the constraints on the statistical performance of slices in the reward function using Lagrangian multipliers, and solve the Lagrangian relaxed problem via a policy network. To satisfy the constraints on the system capacity, we design a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Full-Duplex Wireless Communications · Ferroelectric and Negative Capacitance Devices
