Reinforcement Learning-Based Policy Optimisation For Heterogeneous Radio Access

Anup Mishra; \v{C}edomir Stefanovi\'c; Xiuqiang Xu; Petar Popovski; Israel Leyva-Mayorga

arXiv:2506.15273·eess.SP·June 19, 2025

Reinforcement Learning-Based Policy Optimisation For Heterogeneous Radio Access

Anup Mishra, \v{C}edomir Stefanovi\'c, Xiuqiang Xu, Petar Popovski, Israel Leyva-Mayorga

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach to optimize resource sharing in heterogeneous wireless networks, improving IoT latency and maintaining broadband throughput and energy efficiency.

Contribution

It proposes a double Q-Learning based policy for IoT transmission optimization in grant-free access, applicable to both RAN slicing and sharing scenarios.

Findings

01

RL policies improve IoT latency performance

02

RAN sharing enhances energy efficiency at low traffic

03

RAN slicing is preferable at high IoT traffic

Abstract

Flexible and efficient wireless resource sharing across heterogeneous services is a key objective for future wireless networks. In this context, we investigate the performance of a system where latency-constrained internet-of-things (IoT) devices coexist with a broadband user. The base station adopts a grant-free access framework to manage resource allocation, either through orthogonal radio access network (RAN) slicing or by allowing shared access between services. For the IoT users, we propose a reinforcement learning (RL) approach based on double Q-Learning (QL) to optimise their repetition-based transmission strategy, allowing them to adapt to varying levels of interference and meet a predefined latency target. We evaluate the system's performance in terms of the cumulative distribution function of IoT users' latency, as well as the broadband user's throughput and energy efficiency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Wireless Network Optimization · Wireless Networks and Protocols · Advanced MIMO Systems Optimization

MethodsBalanced Selection · Double Q-learning · Q-Learning