Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum   Sensing and Channel Access in Cognitive UAV Networks

Weiheng Jiang; Wanxin Yu; Wenbo Wang; Tiancong Huang

arXiv:2103.08181·cs.NI·February 24, 2022

Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks

Weiheng Jiang, Wanxin Yu, Wenbo Wang, Tiancong Huang

PDF

Open Access

TL;DR

This paper introduces a multi-agent reinforcement learning framework for distributed spectrum sensing and channel access in cognitive UAV networks, improving efficiency and stability without prior knowledge of primary user activity.

Contribution

It formulates a hybrid cooperative-competitive multi-agent RL problem and proposes a UCB-H and DDQN-based algorithm to solve it, addressing the curse of dimensionality.

Findings

01

Algorithms converge to stable strategies

02

Significant performance improvements over benchmarks

03

Effective handling of primary user activity uncertainty

Abstract

This paper studies the problem of distributed spectrum/channel access for cognitive radio-enabled unmanned aerial vehicles (CUAVs) that overlay upon primary channels. Under the framework of cooperative spectrum sensing and opportunistic transmission, a one-shot optimization problem for channel allocation, aiming to maximize the expected cumulative weighted reward of multiple CUAVs, is formulated. To handle the uncertainty due to the lack of prior knowledge about the primary user activities as well as the lack of the channel-access coordinator, the original problem is cast into a competition and cooperation hybrid multi-agent reinforcement learning (CCH-MARL) problem in the framework of Markov game (MG). Then, a value-iteration-based RL algorithm, which features upper confidence bound-Hoeffding (UCB-H) strategy searching, is proposed by treating each CUAV as an independent learner (IL).…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCognitive Radio Networks and Spectrum Sensing · Risk and Portfolio Optimization · Age of Information Optimization