Self-Play Ensemble Q-learning enabled Resource Allocation for Network   Slicing

Shavbo Salehi; Pedro Enrique Iturria-Rivera; Medhat Elsayed; Majid; Bavand; Raimundas Gaigalas; Yigit Ozcan; and Melike Erol-Kantarci

arXiv:2408.10376·cs.NI·August 21, 2024

Self-Play Ensemble Q-learning enabled Resource Allocation for Network Slicing

Shavbo Salehi, Pedro Enrique Iturria-Rivera, Medhat Elsayed, Majid, Bavand, Raimundas Gaigalas, Yigit Ozcan, and Melike Erol-Kantarci

PDF

Open Access

TL;DR

This paper introduces self-play ensemble Q-learning for network slicing in 5G, improving resource allocation by reducing overestimation and enhancing robustness against adversarial attacks, outperforming traditional RL methods.

Contribution

The paper proposes a novel self-play ensemble Q-learning approach that enhances RL-based resource allocation in 5G network slicing, addressing overestimation and robustness issues.

Findings

01

Self-play ensemble Q-learning reduces latency by 21.92%.

02

It improves throughput by 24.22%.

03

It is more robust against malicious users compared to baseline methods.

Abstract

In 5G networks, network slicing has emerged as a pivotal paradigm to address diverse user demands and service requirements. To meet the requirements, reinforcement learning (RL) algorithms have been utilized widely, but this method has the problem of overestimation and exploration-exploitation trade-offs. To tackle these problems, this paper explores the application of self-play ensemble Q-learning, an extended version of the RL-based technique. Self-play ensemble Q-learning utilizes multiple Q-tables with various exploration-exploitation rates leading to different observations for choosing the most suitable action for each state. Moreover, through self-play, each model endeavors to enhance its performance compared to its previous iterations, boosting system efficiency, and decreasing the effect of overestimation. For performance evaluation, we consider three RL-based algorithms;…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware-Defined Networks and 5G · Advanced Computing and Algorithms · Wireless Body Area Networks