Concentration Network for Reinforcement Learning of Large-Scale   Multi-Agent Systems

Qingxu Fu; Tenghai Qiu; Jianqiang Yi; Zhiqiang Pu; Shiguang Wu

arXiv:2203.06416·cs.AI·April 8, 2022·1 cites

Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems

Qingxu Fu, Tenghai Qiu, Jianqiang Yi, Zhiqiang Pu, Shiguang Wu

PDF

Open Access 1 Video

TL;DR

This paper introduces ConcNet, a novel concentration network that prioritizes and aggregates entity observations in large-scale multi-agent systems, improving scalability and performance in reinforcement learning tasks.

Contribution

ConcNet is a new concentration network that explicitly scores, ranks, and prunes observed entities based on motivational indices, enhancing efficiency in large-scale multi-agent reinforcement learning.

Findings

01

ConcNet outperforms existing methods on LMAS benchmarks.

02

It demonstrates excellent scalability and flexibility.

03

The concentration policy gradient effectively learns policies from scratch.

Abstract

When dealing with a series of imminent issues, humans can naturally concentrate on a subset of these concerning issues by prioritizing them according to their contributions to motivational indices, e.g., the probability of winning a game. This idea of concentration offers insights into reinforcement learning of sophisticated Large-scale Multi-Agent Systems (LMAS) participated by hundreds of agents. In such an LMAS, each agent receives a long series of entity observations at each step, which can overwhelm existing aggregation networks such as graph attention networks and cause inefficiency. In this paper, we propose a concentration network called ConcNet. First, ConcNet scores the observed entities considering several motivational indices, e.g., expected survival time and state value of the agents, and then ranks, prunes, and aggregates the encodings of observed entities to extract…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems· underline

Taxonomy

TopicsReinforcement Learning in Robotics