Multi-Power Level $Q$-Learning Algorithm for Random Access in NOMA mMTC Systems
Giovanni Maciel Ferreira Silva, Taufik Abr\~ao

TL;DR
This paper introduces a multi-power level Q-learning algorithm for NOMA-based mMTC systems, improving resource allocation efficiency by leveraging power diversity to reduce collisions and enhance throughput.
Contribution
It proposes a novel multi-power level Q-learning algorithm that utilizes NOMA to increase device capacity and performance in massive random access scenarios.
Findings
Higher number of power levels (around eight) yields better performance.
The proposed MPL-QL outperforms recent QL-based algorithms in throughput.
The algorithm achieves a good balance between complexity and performance.
Abstract
The massive machine-type communications (mMTC) service will be part of new services planned to integrate the fifth generation of wireless communication (B5G). In mMTC, thousands of devices sporadically access available resource blocks on the network. In this scenario, the massive random access (RA) problem arises when two or more devices collide when selecting the same resource block. There are several techniques to deal with this problem. One of them deploys -learning (QL), in which devices store in their -table the rewards sent by the central node that indicate the quality of the transmission performed. The device learns the best resource blocks to select and transmit to avoid collisions. We propose a multi-power level QL (MPL-QL) algorithm that uses non-orthogonal multiple access (NOMA) transmit scheme to generate transmission power diversity and allow {accommodate} more than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
