Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Yun Hua, Hongyuan, Zha

TL;DR
This paper introduces Rochico, a novel multi-agent reinforcement learning framework that enhances cooperation and exploration through structured team formation and hierarchical consensus, outperforming current state-of-the-art methods.
Contribution
The paper proposes Rochico, a new MARL framework combining adaptive grouping, hierarchical consensus, and self-supervised rewards for improved cooperative behavior.
Findings
Outperforms SOTA algorithms in exploration efficiency.
Enhances cooperation strength in large-scale tasks.
Effectively promotes structured diversification emergence.
Abstract
When solving a complex task, humans will spontaneously form teams and to complete different parts of the whole task, respectively. Meanwhile, the cooperation between teammates will improve efficiency. However, for current cooperative MARL methods, the cooperation team is constructed through either heuristics or end-to-end blackbox optimization. In order to improve the efficiency of cooperation and exploration, we propose a structured diversification emergence MARL framework named {\sc{Rochico}} based on reinforced organization control and hierarchical consensus learning. {\sc{Rochico}} first learns an adaptive grouping policy through the organization control module, which is established by independent multi-agent reinforcement learning. Further, the hierarchical consensus module based on the hierarchical intentions with consensus constraint is introduced after team formation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Reinforcement Learning in Robotics · Visual Attention and Saliency Detection
