Deep Reinforcement Learning for Distributed and Uncoordinated Cognitive Radios Resource Allocation
Ankita Tondwalkar, Andres Kwasinski

TL;DR
This paper introduces a distributed deep reinforcement learning method for resource allocation in cognitive radio networks, demonstrating faster convergence and near-optimal policies in multi-agent, uncoordinated environments.
Contribution
It presents a novel distributed deep reinforcement learning algorithm that converges in non-stationary multi-agent environments without requiring coordination.
Findings
Achieves faster learning than table-based Q-learning.
Finds optimal policies in 99% of cases with sufficient training.
Requires less than half the steps of traditional methods.
Abstract
This paper presents a novel deep reinforcement learning-based resource allocation technique for the multi-agent environment presented by a cognitive radio network where the interactions of the agents during learning may lead to a non-stationary environment. The resource allocation technique presented in this work is distributed, not requiring coordination with other agents. It is shown by considering aspects specific to deep reinforcement learning that the presented algorithm converges in an arbitrarily long time to equilibrium policies in a non-stationary multi-agent environment that results from the uncoordinated dynamic interaction between radios through the shared wireless environment. Simulation results show that the presented technique achieves a faster learning performance compared to an equivalent table-based Q-learning algorithm and is able to find the optimal policy in 99% of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Radio Networks and Spectrum Sensing · Full-Duplex Wireless Communications · Energy Harvesting in Wireless Networks
MethodsQ-Learning
