Resource Allocation in Uplink NOMA-IoT Networks: A Reinforcement-Learning Approach
Waleed Ahsan, Wenqiang Yi, Zhijin Qin, Yuanwei Liu, and Arumugam, Nallanathan

TL;DR
This paper proposes a reinforcement learning-based resource allocation scheme for uplink NOMA-IoT networks, improving throughput and fairness by dynamically clustering users under varying traffic conditions.
Contribution
It introduces a novel combination of DRL and SARSA-learning algorithms for adaptive resource allocation in NOMA-IoT, addressing traffic variability and fairness.
Findings
DRL and SARSA-learning algorithms outperform traditional methods in throughput.
The proposed schemes have lower complexity with acceptable accuracy.
NOMA-IoT networks outperform orthogonal access in system throughput.
Abstract
Non-orthogonal multiple access (NOMA) exploits the potential of the power domain to enhance the connectivity for the Internet of Things (IoT). Due to time-varying communication channels, dynamic user clustering is a promising method to increase the throughput of NOMA-IoT networks. This paper develops an intelligent resource allocation scheme for uplink NOMA-IoT communications. To maximise the average performance of sum rates, this work designs an efficient optimization approach based on two reinforcement learning algorithms, namely deep reinforcement learning (DRL) and SARSA-learning. For light traffic, SARSA-learning is used to explore the safest resource allocation policy with low cost. For heavy traffic, DRL is used to handle traffic-introduced huge variables. With the aid of the considered approach, this work addresses two main problems of fair resource allocation in NOMA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
