Learning Heterogeneous Agent Cooperation via Multiagent League Training

Qingxu Fu; Xiaolin Ai; Jianqiang Yi; Tenghai Qiu; Wanmai Yuan,; Zhiqiang Pu

arXiv:2211.11616·cs.LG·May 30, 2023

Learning Heterogeneous Agent Cooperation via Multiagent League Training

Qingxu Fu, Xiaolin Ai, Jianqiang Yi, Tenghai Qiu, Wanmai Yuan,, Zhiqiang Pu

PDF

Open Access 1 Repo

TL;DR

This paper introduces Heterogeneous League Training (HLT), a reinforcement learning algorithm designed to improve cooperation among diverse agents in multiagent systems by maintaining a policy pool and using a hyper-network to enhance behavioral diversity.

Contribution

The paper presents a novel RL algorithm, HLT, that effectively addresses challenges in heterogeneous multiagent systems, including non-stationarity and policy iteration issues, with a new policy league and diversity mechanism.

Findings

01

HLT increases success rates in cooperative heterogeneous tasks.

02

HLT effectively solves the policy version iteration problem.

03

HLT offers a practical method to evaluate learning difficulty of roles.

Abstract

Many multiagent systems in the real world include multiple types of agents with different abilities and functionality. Such heterogeneous multiagent systems have significant practical advantages. However, they also come with challenges compared with homogeneous systems for multiagent reinforcement learning, such as the non-stationary problem and the policy version iteration issue. This work proposes a general-purpose reinforcement learning algorithm named Heterogeneous League Training (HLT) to address heterogeneous multiagent problems. HLT keeps track of a pool of policies that agents have explored during training, gathering a league of heterogeneous policies to facilitate future policy optimization. Moreover, a hyper-network is introduced to increase the diversity of agent behaviors when collaborating with teammates having different levels of cooperation skills. We use heterogeneous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

binary-husky/unreal-hmp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics