Reward-Reinforced Reinforcement Learning for Multi-agent Systems

Changgang Zheng; Shufan Yang; Juan Parra-Ullauri; Antonio; Garcia-Dominguez; and Nelly Bencomo

arXiv:2103.12192·cs.MA·May 18, 2021·1 cites

Reward-Reinforced Reinforcement Learning for Multi-agent Systems

Changgang Zheng, Shufan Yang, Juan Parra-Ullauri, Antonio, Garcia-Dominguez, and Nelly Bencomo

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reward-reinforced generative adversarial network to improve multi-agent reinforcement learning by modeling value distribution, demonstrating resilience and superior performance in practical communication network scenarios.

Contribution

It presents a novel reward-reinforced GAN framework for multi-agent systems that enhances learning efficiency and effectiveness over traditional methods.

Findings

01

Outperforms conventional reinforcement learning algorithms

02

Demonstrates resilience in multi-agent environments

03

Effective in maximizing user connections in communication networks

Abstract

Reinforcement learning algorithms in multi-agent systems deliver highly resilient and adaptable solutions for common problems in telecommunications,aerospace, and industrial robotics. However, achieving an optimal global goal remains a persistent obstacle for collaborative multi-agent systems, where learning affects the behaviour of more than one agent. A number of nonlinear function approximation methods have been proposed for solving the Bellman equation, which describe a recursive format of an optimal policy. However, how to leverage the value distribution based on reinforcement learning, and how to improve the efficiency and efficacy of such systems remain a challenge. In this work, we developed a reward-reinforced generative adversarial network to represent the distribution of the value function, replacing the approximation of Bellman updates. We demonstrated our method is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Changgang-Zheng/History-awareness-Self-adaptive-System-on-Airborne-Base-Station
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Neural Networks and Reservoir Computing