QoS Assurance Mechanism for 5G Network Slicing Based on the Deep Reinforcement Learning PPO Algorithm

Qingyang Li

arXiv:2605.03345·cs.NI·May 6, 2026

QoS Assurance Mechanism for 5G Network Slicing Based on the Deep Reinforcement Learning PPO Algorithm

Qingyang Li

PDF

TL;DR

This paper introduces a deep reinforcement learning-based mechanism using PPO for ensuring quality of service in 5G network slicing, optimizing resource allocation amidst dynamic network loads.

Contribution

It models resource allocation as a constrained Markov decision process and integrates graph attention and LSTM networks for comprehensive QoS optimization.

Findings

01

Outperforms baseline models in QoS satisfaction rate

02

Achieves better delay control and resource utilization

03

Demonstrates stable convergence in experiments

Abstract

With the increasing diversity of 5G service types and the intensifying dynamic fluctuations of network load, achieve differentiated quality of service assurance in a network slicing environment has become a key issue in resource management. To address this problem, this paper proposes a deep reinforcement learning mechanism for 5G network slicing quality of service assurance based on the traditional proximal policy optimization actor-critic framework. First, the slicing resource allocation is modeled as a constrained Markov decision process, jointly considering the collaborative optimization of bandwidth, computing, and wireless resources. Meanwhile, a graph attention network and bidirectional long short-term memory are introduced to extract topological correlations and temporal service features, combined with an adaptive Lagrangian penalty and dynamic reward shaping mechanism, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.