Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning

Hangyu Mao; Wulong Liu; Jianye Hao; Jun Luo; Dong Li; Zhengchao Zhang,; Jun Wang; Zhen Xiao

arXiv:1912.01160·cs.AI·February 11, 2020·6 cites

Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning

Hangyu Mao, Wulong Liu, Jianye Hao, Jun Luo, Dong Li, Zhengchao Zhang,, Jun Wang, Zhen Xiao

PDF

Open Access

TL;DR

This paper introduces neighborhood cognitive consistency into multi-agent reinforcement learning, enhancing cooperation in large-scale multi-agent systems through novel NCC-based algorithms demonstrated on various complex tasks.

Contribution

It proposes a general NCC framework for MARL, integrating cognitive consistency principles to improve cooperation, with specific implementations in deep Q-learning and Actor-Critic methods.

Findings

01

NCC-based methods outperform state-of-the-art MARL approaches.

02

The approach is effective in tasks like packet routing, wifi configuration, and football control.

03

NCC enhances large-scale multi-agent cooperation.

Abstract

Social psychology and real experiences show that cognitive consistency plays an important role to keep human society in order: if people have a more consistent cognition about their environments, they are more likely to achieve better cooperation. Meanwhile, only cognitive consistency within a neighborhood matters because humans only interact directly with their neighbors. Inspired by these observations, we take the first step to introduce \emph{neighborhood cognitive consistency} (NCC) into multi-agent reinforcement learning (MARL). Our NCC design is quite general and can be easily combined with existing MARL methods. As examples, we propose neighborhood cognition consistent deep Q-learning and Actor-Critic to facilitate large-scale multi-agent cooperations. Extensive experiments on several challenging tasks (i.e., packet routing, wifi configuration, and Google football player control)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Complex Network Analysis Techniques · Mobile Crowdsensing and Crowdsourcing

MethodsQ-Learning