Knowing What Not to Do: Leverage Language Model Insights for Action   Space Pruning in Multi-agent Reinforcement Learning

Zhihao Liu; Xianliang Yang; Zichuan Liu; Yifan Xia; Wei Jiang; Yuanyu; Zhang; Lijuan Li; Guoliang Fan; Lei Song; Bian Jiang

arXiv:2405.16854·cs.MA·May 28, 2024·1 cites

Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning

Zhihao Liu, Xianliang Yang, Zichuan Liu, Yifan Xia, Wei Jiang, Yuanyu, Zhang, Lijuan Li, Guoliang Fan, Lei Song, Bian Jiang

PDF

Open Access 1 Repo

TL;DR

This paper introduces eSpark, a framework using large language models to automatically generate exploration functions that prune unnecessary actions in multi-agent reinforcement learning, improving scalability and performance across various scenarios.

Contribution

eSpark leverages LLMs to generate exploration functions in a zero-shot manner, enabling autonomous action pruning and performance enhancement without manual prior knowledge.

Findings

01

eSpark outperforms traditional MARL algorithms in 15 scenarios with an average gain of 34.4%.

02

eSpark achieves a 9.9% performance improvement in traffic light control tasks.

03

eSpark manages over 500 agents, improving scalability by 29.7%.

Abstract

Multi-agent reinforcement learning (MARL) is employed to develop autonomous agents that can learn to adopt cooperative or competitive strategies within complex environments. However, the linear increase in the number of agents leads to a combinatorial explosion of the action space, which may result in algorithmic instability, difficulty in convergence, or entrapment in local optima. While researchers have designed a variety of effective algorithms to compress the action space, these methods also introduce new challenges, such as the need for manually designed prior knowledge or reliance on the structure of the problem, which diminishes the applicability of these techniques. In this paper, we introduce Evolutionary action SPAce Reduction with Knowledge (eSpark), an exploration function generation framework driven by large language models (LLMs) to boost exploration and prune unnecessary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liuzhihao2022/espark
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multi-Agent Systems and Negotiation