Language-Driven Coordination and Learning in Multi-Agent Simulation Environments

Zhengyang Li; Sawyer Campos; Nana Wang

arXiv:2506.04251·cs.AI·November 4, 2025

Language-Driven Coordination and Learning in Multi-Agent Simulation Environments

Zhengyang Li, Sawyer Campos, Nana Wang

PDF

Open Access

TL;DR

This paper presents LLM-MARL, a framework integrating large language models into multi-agent reinforcement learning to improve coordination, communication, and generalization in simulated environments, demonstrating significant performance gains.

Contribution

It introduces a novel modular framework that combines LLMs with MARL, enabling dynamic subgoal generation, symbolic messaging, and episodic memory for better multi-agent cooperation.

Findings

01

Improved win rates and coordination scores over baseline methods.

02

Significant zero-shot generalization in complex environments.

03

Emergent behaviors like role specialization and strategic communication.

Abstract

This paper introduces LLM-MARL, a unified framework that incorporates large language models (LLMs) into multi-agent reinforcement learning (MARL) to enhance coordination, communication, and generalization in simulated game environments. The framework features three modular components of Coordinator, Communicator, and Memory, which dynamically generate subgoals, facilitate symbolic inter-agent messaging, and support episodic recall. Training combines PPO with a language-conditioned loss and LLM query gating. LLM-MARL is evaluated in Google Research Football, MAgent Battle, and StarCraft II. Results show consistent improvements over MAPPO and QMIX in win rate, coordination score, and zero-shot generalization. Ablation studies demonstrate that subgoal generation and language-based messaging each contribute significantly to performance gains. Qualitative analysis reveals emergent behaviors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Multi-Agent Systems and Negotiation

MethodsEntropy Regularization · Proximal Policy Optimization