A Reliable Reinforcement Learning for Resource Allocation in Uplink   NOMA-URLLC Networks

Waleed Ahsan; Wenqiang Yi; Yuanwei Liu; and Arumugam Nallanathan

arXiv:2201.06027·cs.IT·January 19, 2022

A Reliable Reinforcement Learning for Resource Allocation in Uplink NOMA-URLLC Networks

Waleed Ahsan, Wenqiang Yi, Yuanwei Liu, and Arumugam Nallanathan

PDF

Open Access

TL;DR

This paper introduces a deep SARSA(λ) reinforcement learning approach to optimize uplink resource allocation in NOMA-URLLC networks, improving reliability and reducing decoding errors in dynamic environments.

Contribution

It presents a novel deep SARSA(λ) algorithm tailored for resource sharing in NOMA-URLLC, addressing user clustering, feedback, and optimal allocation challenges.

Findings

01

Converges within 200 episodes to low mean error

02

Outperforms traditional Q-learning in decoding error probability

03

Efficient feedback system enhances long-term learning

Abstract

In this paper, we propose a deep state-action-reward-state-action (SARSA) $λ$ learning approach for optimising the uplink resource allocation in non-orthogonal multiple access (NOMA) aided ultra-reliable low-latency communication (URLLC). To reduce the mean decoding error probability in time-varying network environments, this work designs a reliable learning algorithm for providing a long-term resource allocation, where the reward feedback is based on the instantaneous network performance. With the aid of the proposed algorithm, this paper addresses three main challenges of the reliable resource sharing in NOMA-URLLC networks: 1) user clustering; 2) Instantaneous feedback system; and 3) Optimal resource allocation. All of these designs interact with the considered communication environment. Lastly, we compare the performance of the proposed algorithm with conventional Q-learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · Advanced Wireless Communication Technologies · Wireless Body Area Networks