Hierarchical Reinforcement Learning for Relay Selection and Power Optimization in Two-Hop Cooperative Relay Network
Yuanzhe Geng, Erwu Liu, Rui Wang, and Yiming Liu

TL;DR
This paper introduces a hierarchical reinforcement learning framework for relay selection and power optimization in two-hop cooperative relay networks, effectively reducing outage probability without requiring channel state information.
Contribution
It proposes a novel HRL approach that decomposes relay selection and power allocation, improving convergence speed and outage reduction over traditional DRL methods.
Findings
HRL converges 30 iterations faster than baseline DRL.
HRL reduces outage probability by 5% compared to traditional methods.
The approach does not require prior channel state information.
Abstract
Cooperative communication is an effective approach to improve spectrum utilization. In order to reduce outage probability of communication system, most studies propose various schemes for relay selection and power allocation, which are based on the assumption of channel state information (CSI). However, it is difficult to get an accurate CSI in practice. In this paper, we study the outage probability minimizing problem subjected to a total transmission power constraint in a two-hop cooperative relay network. We use reinforcement learning (RL) methods to learn strategies for relay selection and power allocation, which do not need any prior knowledge of CSI but simply rely on the interaction with communication environment. It is noted that conventional RL methods, including most deep reinforcement learning (DRL) methods, cannot perform well when the search space is too large. Therefore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCooperative Communication and Network Coding · Advanced MIMO Systems Optimization · Full-Duplex Wireless Communications
