Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions

Shanqi Liu; Weiwei Liu; Wenzhou Chen; Guanzhong Tian; Yong Liu

arXiv:2307.02200·cs.MA·July 6, 2023

Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions

Shanqi Liu, Weiwei Liu, Wenzhou Chen, Guanzhong Tian, Yong Liu

PDF

Open Access

TL;DR

This paper introduces a novel multi-agent reinforcement learning approach that learns joint intentions in an unsupervised manner, improving cooperation and performance in complex multi-agent tasks.

Contribution

It proposes a hierarchical MARL framework that autonomously learns joint intentions in a latent space, addressing non-monotonic return issues and enhancing interpretability.

Findings

01

Significant performance improvements in StarCraft benchmarks

02

Effective learning of meaningful joint intentions

03

Adaptability to different agent configurations

Abstract

The field of cooperative multi-agent reinforcement learning (MARL) has seen widespread use in addressing complex coordination tasks. While value decomposition methods in MARL have been popular, they have limitations in solving tasks with non-monotonic returns, restricting their general application. Our work highlights the significance of joint intentions in cooperation, which can overcome non-monotonic problems and increase the interpretability of the learning process. To this end, we present a novel MARL method that leverages learnable joint intentions. Our method employs a hierarchical framework consisting of a joint intention policy and a behavior policy to formulate the optimal cooperative policy. The joint intentions are autonomously learned in a latent space through unsupervised learning and enable the method adaptable to different agent configurations. Our results demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics