Deep Reinforcement Learning for Distributed and Uncoordinated Cognitive   Radios Resource Allocation

Ankita Tondwalkar; Andres Kwasinski

arXiv:2205.13944·cs.LG·May 30, 2022

Deep Reinforcement Learning for Distributed and Uncoordinated Cognitive Radios Resource Allocation

Ankita Tondwalkar, Andres Kwasinski

PDF

Open Access

TL;DR

This paper introduces a distributed deep reinforcement learning method for resource allocation in cognitive radio networks, demonstrating faster convergence and near-optimal policies in multi-agent, uncoordinated environments.

Contribution

It presents a novel distributed deep reinforcement learning algorithm that converges in non-stationary multi-agent environments without requiring coordination.

Findings

01

Achieves faster learning than table-based Q-learning.

02

Finds optimal policies in 99% of cases with sufficient training.

03

Requires less than half the steps of traditional methods.

Abstract

This paper presents a novel deep reinforcement learning-based resource allocation technique for the multi-agent environment presented by a cognitive radio network where the interactions of the agents during learning may lead to a non-stationary environment. The resource allocation technique presented in this work is distributed, not requiring coordination with other agents. It is shown by considering aspects specific to deep reinforcement learning that the presented algorithm converges in an arbitrarily long time to equilibrium policies in a non-stationary multi-agent environment that results from the uncoordinated dynamic interaction between radios through the shared wireless environment. Simulation results show that the presented technique achieves a faster learning performance compared to an equivalent table-based Q-learning algorithm and is able to find the optimal policy in 99% of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCognitive Radio Networks and Spectrum Sensing · Full-Duplex Wireless Communications · Energy Harvesting in Wireless Networks

MethodsQ-Learning