QoS Assurance Mechanism for 5G Network Slicing Based on the Deep Reinforcement Learning PPO Algorithm
Qingyang Li

TL;DR
This paper introduces a deep reinforcement learning-based mechanism using PPO for ensuring quality of service in 5G network slicing, optimizing resource allocation amidst dynamic network loads.
Contribution
It models resource allocation as a constrained Markov decision process and integrates graph attention and LSTM networks for comprehensive QoS optimization.
Findings
Outperforms baseline models in QoS satisfaction rate
Achieves better delay control and resource utilization
Demonstrates stable convergence in experiments
Abstract
With the increasing diversity of 5G service types and the intensifying dynamic fluctuations of network load, achieve differentiated quality of service assurance in a network slicing environment has become a key issue in resource management. To address this problem, this paper proposes a deep reinforcement learning mechanism for 5G network slicing quality of service assurance based on the traditional proximal policy optimization actor-critic framework. First, the slicing resource allocation is modeled as a constrained Markov decision process, jointly considering the collaborative optimization of bandwidth, computing, and wireless resources. Meanwhile, a graph attention network and bidirectional long short-term memory are introduced to extract topological correlations and temporal service features, combined with an adaptive Lagrangian penalty and dynamic reward shaping mechanism, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
