Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning
Zhihao Liu, Xianliang Yang, Zichuan Liu, Yifan Xia, Wei Jiang, Yuanyu, Zhang, Lijuan Li, Guoliang Fan, Lei Song, Bian Jiang

TL;DR
This paper introduces eSpark, a framework using large language models to automatically generate exploration functions that prune unnecessary actions in multi-agent reinforcement learning, improving scalability and performance across various scenarios.
Contribution
eSpark leverages LLMs to generate exploration functions in a zero-shot manner, enabling autonomous action pruning and performance enhancement without manual prior knowledge.
Findings
eSpark outperforms traditional MARL algorithms in 15 scenarios with an average gain of 34.4%.
eSpark achieves a 9.9% performance improvement in traffic light control tasks.
eSpark manages over 500 agents, improving scalability by 29.7%.
Abstract
Multi-agent reinforcement learning (MARL) is employed to develop autonomous agents that can learn to adopt cooperative or competitive strategies within complex environments. However, the linear increase in the number of agents leads to a combinatorial explosion of the action space, which may result in algorithmic instability, difficulty in convergence, or entrapment in local optima. While researchers have designed a variety of effective algorithms to compress the action space, these methods also introduce new challenges, such as the need for manually designed prior knowledge or reliance on the structure of the problem, which diminishes the applicability of these techniques. In this paper, we introduce Evolutionary action SPAce Reduction with Knowledge (eSpark), an exploration function generation framework driven by large language models (LLMs) to boost exploration and prune unnecessary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multi-Agent Systems and Negotiation
