Choices are More Important than Efforts: LLM Enables Efficient   Multi-Agent Exploration

Yun Qu; Boyuan Wang; Yuhang Jiang; Jianzhun Shao; Yixiu Mao; Cheems; Wang; Chang Liu; Xiangyang Ji

arXiv:2410.02511·cs.AI·October 4, 2024

Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration

Yun Qu, Boyuan Wang, Yuhang Jiang, Jianzhun Shao, Yixiu Mao, Cheems, Wang, Chang Liu, Xiangyang Ji

PDF

Open Access 1 Repo

TL;DR

This paper presents LEMAE, a systematic approach leveraging large language models to guide multi-agent exploration efficiently by focusing on key states, significantly improving performance and reducing redundant efforts in complex reinforcement learning tasks.

Contribution

The paper introduces LEMAE, a novel framework that uses LLMs to identify key states and guide exploration, reducing redundancy and accelerating learning in multi-agent reinforcement learning.

Findings

01

LEMAE outperforms state-of-the-art methods on benchmarks like SMAC and MPE.

02

Achieves up to 10x faster exploration in certain scenarios.

03

Effectively reduces redundant exploration efforts.

Abstract

With expansive state-action spaces, efficient multi-agent exploration remains a longstanding challenge in reinforcement learning. Although pursuing novelty, diversity, or uncertainty attracts increasing attention, redundant efforts brought by exploration without proper guidance choices poses a practical issue for the community. This paper introduces a systematic approach, termed LEMAE, choosing to channel informative task-relevant guidance from a knowledgeable Large Language Model (LLM) for Efficient Multi-Agent Exploration. Specifically, we ground linguistic knowledge from LLM into symbolic key states, that are critical for task fulfillment, in a discriminative manner at low LLM inference costs. To unleash the power of key states, we design Subspace-based Hindsight Intrinsic Reward (SHIR) to guide agents toward key states by increasing reward density. Additionally, we build the Key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hijkzzz/pymarl2
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies