Learning to Coordinate with Anyone

Lei Yuan; Lihe Li; Ziqian Zhang; Feng Chen; Tianyi Zhang; Cong Guan,; Yang Yu; Zhi-Hua Zhou

arXiv:2309.12633·cs.MA·September 25, 2023

Learning to Coordinate with Anyone

Lei Yuan, Lihe Li, Ziqian Zhang, Feng Chen, Tianyi Zhang, Cong Guan,, Yang Yu, Zhi-Hua Zhou

PDF

Open Access

TL;DR

This paper introduces Macop, a continual multi-agent learning algorithm that generates diverse, incompatible teammates to train agents with robust coordination skills capable of handling any teammate in open environments.

Contribution

The paper proposes a novel continual learning approach to generate diverse teammates covering the policy space, enhancing generalization in multi-agent coordination tasks.

Findings

01

Macop produces more diverse training teammates than previous methods.

02

Agents trained with Macop outperform baselines in coordination ability.

03

Strong generalization demonstrated across multiple scenarios.

Abstract

In open multi-agent environments, the agents may encounter unexpected teammates. Classical multi-agent learning approaches train agents that can only coordinate with seen teammates. Recent studies attempted to generate diverse teammates to enhance the generalizable coordination ability, but were restricted by pre-defined teammates. In this work, our aim is to train agents with strong coordination ability by generating teammates that fully cover the teammate policy space, so that agents can coordinate with any teammates. Since the teammate policy space is too huge to be enumerated, we find only dissimilar teammates that are incompatible with controllable agents, which highly reduces the number of teammates that need to be trained with. However, it is hard to determine the number of such incompatible teammates beforehand. We therefore introduce a continual multi-agent learning process, in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Domain Adaptation and Few-Shot Learning