Multi-Agent Reinforcement Learning for Energy Harvesting Two-Hop Communications with a Partially Observable State
Andrea Ortiz, Hussein Al-Shatri, Tobias Weber, Anja Klein

TL;DR
This paper develops a multi-agent reinforcement learning approach for energy harvesting two-hop communication systems with partial observability, improving throughput through cooperative strategies and channel prediction.
Contribution
It introduces a novel multi-agent RL algorithm for distributed transmission policy optimization in partially observable EH relay networks, with convergence guarantees.
Findings
The proposed algorithm outperforms non-cooperative methods in throughput.
Channel prediction with Kalman filters improves transmission decisions.
Cooperation reduces the impact of partial observability on system performance.
Abstract
We consider an energy harvesting (EH) transmitter communicating with a receiver through an EH relay. The harvested energy is used for data transmission, including the circuit energy consumption. As in practical scenarios, the system state, comprised by the harvested energy, battery levels, data buffer levels, and channel gains, is only partially observable by the EH nodes. Moreover, the EH nodes have only outdated knowledge regarding the channel gains for their own transmit channels. Our goal is to find distributed transmission policies aiming at maximizing the throughput. A channel predictor based on a Kalman filter is implemented in each EH node to estimate the current channel gain for its own channel. Furthermore, to overcome the partial observability of the system state, the EH nodes cooperate with each other to obtain information about their parameters during a signaling phase. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Harvesting in Wireless Networks · Advanced MIMO Systems Optimization · Wireless Power Transfer Systems
