Learning to Advise and Learning from Advice in Cooperative Multi-Agent Reinforcement Learning
Yue Jin, Shuangqing Wei, Jian Yuan, Xudong Zhang

TL;DR
This paper introduces LALA, a hierarchical advice-based framework for multi-agent reinforcement learning that improves coordination and learning efficiency by leveraging spatiotemporal decision structures and multilevel emergence dynamics.
Contribution
It proposes a novel hierarchical advice mechanism with a spatiotemporal neural network and policy discriminator, addressing coordination hierarchy in MARL.
Findings
LALA outperforms baseline methods in learning efficiency.
LALA enhances coordination capabilities in multi-agent systems.
The approach offers a new perspective on analyzing MARL through emergence dynamics.
Abstract
Learning to coordinate is a daunting problem in multi-agent reinforcement learning (MARL). Previous works have explored it from many facets, including cognition between agents, credit assignment, communication, expert demonstration, etc. However, less attention were paid to agents' decision structure and the hierarchy of coordination. In this paper, we explore the spatiotemporal structure of agents' decisions and consider the hierarchy of coordination from the perspective of multilevel emergence dynamics, based on which a novel approach, Learning to Advise and Learning from Advice (LALA), is proposed to improve MARL. Specifically, by distinguishing the hierarchy of coordination, we propose to enhance decision coordination at meso level with an advisor and leverage a policy discriminator to advise agents' learning at micro level. The advisor learns to aggregate decision information in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Decision Making · Reinforcement Learning in Robotics · Open Source Software Innovations
