Multi-agent deep reinforcement learning (MADRL) meets multi-user MIMO systems
Heunchul Lee, Jaeseong Jeong

TL;DR
This paper introduces a multi-agent deep reinforcement learning approach to optimize precoding in multi-user MIMO systems, achieving near-optimal rate regions by addressing partial observability and phase ambiguity.
Contribution
It presents the first application of MA-DDPG to jointly optimize precoders for the Pareto-boundary in multi-user MIMO interference channels.
Findings
MA-DDPG learns near-optimal precoding strategies.
Phase ambiguity elimination improves training speed.
Achieves Pareto-boundary of rate region.
Abstract
A multi-agent deep reinforcement learning (MADRL) is a promising approach to challenging problems in wireless environments involving multiple decision-makers (or actors) with high-dimensional continuous action space. In this paper, we present a MADRL-based approach that can jointly optimize precoders to achieve the outer-boundary, called pareto-boundary, of the achievable rate region for a multiple-input single-output (MISO) interference channel (IFC). In order to address two main challenges, namely, multiple actors (or agents) with partial observability and multi-dimensional continuous action space in MISO IFC setup, we adopt a multi-agent deep deterministic policy gradient (MA-DDPG) framework in which decentralized actors with partial observability can learn a multi-dimensional continuous policy in a centralized manner with the aid of shared critic with global information. Meanwhile,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMicrowave Engineering and Waveguides · Full-Duplex Wireless Communications · Energy Harvesting in Wireless Networks
