Reward-Reinforced Reinforcement Learning for Multi-agent Systems
Changgang Zheng, Shufan Yang, Juan Parra-Ullauri, Antonio, Garcia-Dominguez, and Nelly Bencomo

TL;DR
This paper introduces a reward-reinforced generative adversarial network to improve multi-agent reinforcement learning by modeling value distribution, demonstrating resilience and superior performance in practical communication network scenarios.
Contribution
It presents a novel reward-reinforced GAN framework for multi-agent systems that enhances learning efficiency and effectiveness over traditional methods.
Findings
Outperforms conventional reinforcement learning algorithms
Demonstrates resilience in multi-agent environments
Effective in maximizing user connections in communication networks
Abstract
Reinforcement learning algorithms in multi-agent systems deliver highly resilient and adaptable solutions for common problems in telecommunications,aerospace, and industrial robotics. However, achieving an optimal global goal remains a persistent obstacle for collaborative multi-agent systems, where learning affects the behaviour of more than one agent. A number of nonlinear function approximation methods have been proposed for solving the Bellman equation, which describe a recursive format of an optimal policy. However, how to leverage the value distribution based on reinforcement learning, and how to improve the efficiency and efficacy of such systems remain a challenge. In this work, we developed a reward-reinforced generative adversarial network to represent the distribution of the value function, replacing the approximation of Bellman updates. We demonstrated our method is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Neural Networks and Reservoir Computing
