Collaborative Intelligent Reflecting Surface Networks with Multi-Agent Reinforcement Learning
Jie Zhang, Jun Li, Yijin Zhang, Qingqing Wu, Xiongwei Wu, Feng Shu,, Shi Jin, Wen Chen

TL;DR
This paper develops multi-agent reinforcement learning algorithms to optimize intelligent reflecting surface networks with energy harvesting, improving data rates and convergence speed in dynamic wireless environments.
Contribution
It introduces the MAQ framework with Wolpertinger and policy gradient enhancements for efficient IRS phase shift optimization under energy constraints.
Findings
MAQ-WP achieves 10.7% higher data rate than multi-agent DDPG.
MAQ-PG achieves 8.8% higher data rate with reduced complexity.
Both algorithms converge faster in dynamic IRS-assisted networks.
Abstract
Intelligent reflecting surface (IRS) is envisioned to be widely applied in future wireless networks. In this paper, we investigate a multi-user communication system assisted by cooperative IRS devices with the capability of energy harvesting. Aiming to maximize the long-term average achievable system rate, an optimization problem is formulated by jointly designing the transmit beamforming at the base station (BS) and discrete phase shift beamforming at the IRSs, with the constraints on transmit power, user data rate requirement and IRS energy buffer size. Considering time-varying channels and stochastic arrivals of energy harvested by the IRSs, we first formulate the problem as a Markov decision process (MDP) and then develop a novel multi-agent Q-mix (MAQ) framework with two layers to decouple the optimization parameters. The higher layer is for optimizing phase shift resolutions, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Wireless Communication Technologies · Optical Wireless Communication Technologies · Satellite Communication Systems
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Balanced Selection · Dense Connections · Convolution · Batch Normalization · Experience Replay · Weight Decay · Adam · Deep Deterministic Policy Gradient
