AIR: Unifying Individual and Collective Exploration in Cooperative   Multi-Agent Reinforcement Learning

Guangchong Zhou; Zeren Zhang; Guoliang Fan

arXiv:2412.15700·cs.AI·December 31, 2024

AIR: Unifying Individual and Collective Exploration in Cooperative Multi-Agent Reinforcement Learning

Guangchong Zhou, Zeren Zhang, Guoliang Fan

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces AIR, a novel adaptive exploration method for cooperative multi-agent reinforcement learning that unifies individual and collective exploration, improving training efficiency and effectiveness.

Contribution

AIR combines adversarial identity recognition with adaptive exploration, enabling simultaneous individual and collective exploration without additional structural complexity.

Findings

01

AIR enhances exploration efficiency in MARL tasks

02

AIR improves training effectiveness across various environments

03

Theoretical proof supports AIR's dual exploration capabilities

Abstract

Exploration in cooperative multi-agent reinforcement learning (MARL) remains challenging for value-based agents due to the absence of an explicit policy. Existing approaches include individual exploration based on uncertainty towards the system and collective exploration through behavioral diversity among agents. However, the introduction of additional structures often leads to reduced training efficiency and infeasible integration of these methods. In this paper, we propose Adaptive exploration via Identity Recognition~(AIR), which consists of two adversarial components: a classifier that recognizes agent identities from their trajectories, and an action selector that adaptively adjusts the mode and degree of exploration. We theoretically prove that AIR can facilitate both individual and collective exploration during training, and experiments also demonstrate the efficiency and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jugg1er/air
pytorchOfficial

Videos

AIR: Unifying Individual and Collective Exploration in Cooperative Multi-Agent Reinforcement Learning· underline

Taxonomy

TopicsReinforcement Learning in Robotics