Multi-agent Exploration with Sub-state Entropy Estimation

Jian Tao; Yang Zhang; Yangkun Chen; Xiu Li

arXiv:2306.06382·cs.MA·June 13, 2023·1 cites

Multi-agent Exploration with Sub-state Entropy Estimation

Jian Tao, Yang Zhang, Yangkun Chen, Xiu Li

PDF

Open Access

TL;DR

This paper introduces MESE, a novel multi-agent exploration method that uses sub-state entropy estimation to promote cooperative exploration, significantly enhancing performance in complex multi-agent environments like StarCraft.

Contribution

MESE is a new exploration approach that incentivizes cooperation through entropy-based sub-state selection, easily integrated into existing MARL algorithms.

Findings

01

MESE improves MAPPO performance on SMAC tasks.

02

Entropy-based sub-state selection effectively guides cooperative exploration.

03

MESE is compatible with most MARL algorithms.

Abstract

Researchers have integrated exploration techniques into multi-agent reinforcement learning (MARL) algorithms, drawing on their remarkable success in deep reinforcement learning. Nonetheless, exploration in MARL presents a more substantial challenge, as agents need to coordinate their efforts in order to achieve comprehensive state coverage. Reaching a unanimous agreement on which kinds of states warrant exploring can be a struggle for agents in this context. We introduce \textbf{M}ulti-agent \textbf{E}xploration based on \textbf{S}ub-state \textbf{E}ntropy (MESE) to address this limitation. This novel approach incentivizes agents to explore states cooperatively by directing them to achieve consensus via an extra team reward. Calculating the additional reward is based on the novelty of the current sub-state that merits cooperative exploration. MESE employs a conditioned entropy approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Evolutionary Algorithms and Applications