A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement   Learning

Qingxu Fu; Tenghai Qiu; Zhiqiang Pu; Jianqiang Yi; Wanmai Yuan

arXiv:2208.03002·cs.AI·August 8, 2022

A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning

Qingxu Fu, Tenghai Qiu, Zhiqiang Pu, Jianqiang Yi, Wanmai Yuan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel Cooperation Graph framework and a corresponding reinforcement learning algorithm to improve multiagent cooperation efficiency in sparse reward environments, achieving state-of-the-art results.

Contribution

The paper proposes a new Cooperation Graph structure and a CG-MARL algorithm that effectively addresses sparse reward challenges in multiagent reinforcement learning.

Findings

01

CG-MARL outperforms existing methods in benchmark tasks

02

The Cooperation Graph enables implicit cooperation among agents

03

Hierarchical graph control improves learning efficiency

Abstract

Multiagent reinforcement learning (MARL) can solve complex cooperative tasks. However, the efficiency of existing MARL methods relies heavily on well-defined reward functions. Multiagent tasks with sparse reward feedback are especially challenging not only because of the credit distribution problem, but also due to the low probability of obtaining positive reward feedback. In this paper, we design a graph network called Cooperation Graph (CG). The Cooperation Graph is the combination of two simple bipartite graphs, namely, the Agent Clustering subgraph (ACG) and the Cluster Designating subgraph (CDG). Next, based on this novel graph structure, we propose a Cooperation Graph Multiagent Reinforcement Learning (CG-MARL) algorithm, which can efficiently deal with the sparse reward problem in multiagent tasks. In CG-MARL, agents are directly controlled by the Cooperation Graph. And a policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

binary-husky/hmp2g
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics