Learning to Advise and Learning from Advice in Cooperative Multi-Agent   Reinforcement Learning

Yue Jin; Shuangqing Wei; Jian Yuan; Xudong Zhang

arXiv:2205.11163·cs.MA·May 24, 2022

Learning to Advise and Learning from Advice in Cooperative Multi-Agent Reinforcement Learning

Yue Jin, Shuangqing Wei, Jian Yuan, Xudong Zhang

PDF

Open Access

TL;DR

This paper introduces LALA, a hierarchical advice-based framework for multi-agent reinforcement learning that improves coordination and learning efficiency by leveraging spatiotemporal decision structures and multilevel emergence dynamics.

Contribution

It proposes a novel hierarchical advice mechanism with a spatiotemporal neural network and policy discriminator, addressing coordination hierarchy in MARL.

Findings

01

LALA outperforms baseline methods in learning efficiency.

02

LALA enhances coordination capabilities in multi-agent systems.

03

The approach offers a new perspective on analyzing MARL through emergence dynamics.

Abstract

Learning to coordinate is a daunting problem in multi-agent reinforcement learning (MARL). Previous works have explored it from many facets, including cognition between agents, credit assignment, communication, expert demonstration, etc. However, less attention were paid to agents' decision structure and the hierarchy of coordination. In this paper, we explore the spatiotemporal structure of agents' decisions and consider the hierarchy of coordination from the perspective of multilevel emergence dynamics, based on which a novel approach, Learning to Advise and Learning from Advice (LALA), is proposed to improve MARL. Specifically, by distinguishing the hierarchy of coordination, we propose to enhance decision coordination at meso level with an advisor and leverage a policy discriminator to advise agents' learning at micro level. The advisor learns to aggregate decision information in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making · Reinforcement Learning in Robotics · Open Source Software Innovations