Structured Diversification Emergence via Reinforced Organization Control   and Hierarchical Consensus Learning

Wenhao Li; Xiangfeng Wang; Bo Jin; Junjie Sheng; Yun Hua; Hongyuan; Zha

arXiv:2102.04775·cs.LG·February 10, 2021·1 cites

Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning

Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Yun Hua, Hongyuan, Zha

PDF

Open Access

TL;DR

This paper introduces Rochico, a novel multi-agent reinforcement learning framework that enhances cooperation and exploration through structured team formation and hierarchical consensus, outperforming current state-of-the-art methods.

Contribution

The paper proposes Rochico, a new MARL framework combining adaptive grouping, hierarchical consensus, and self-supervised rewards for improved cooperative behavior.

Findings

01

Outperforms SOTA algorithms in exploration efficiency.

02

Enhances cooperation strength in large-scale tasks.

03

Effectively promotes structured diversification emergence.

Abstract

When solving a complex task, humans will spontaneously form teams and to complete different parts of the whole task, respectively. Meanwhile, the cooperation between teammates will improve efficiency. However, for current cooperative MARL methods, the cooperation team is constructed through either heuristics or end-to-end blackbox optimization. In order to improve the efficiency of cooperation and exploration, we propose a structured diversification emergence MARL framework named {\sc{Rochico}} based on reinforced organization control and hierarchical consensus learning. {\sc{Rochico}} first learns an adaptive grouping policy through the organization control module, which is established by independent multi-agent reinforcement learning. Further, the hierarchical consensus module based on the hierarchical intentions with consensus constraint is introduced after team formation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Reinforcement Learning in Robotics · Visual Attention and Saliency Detection