Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication
Huao Li, Hossein Nourkhiz Mahjoub, Behdad Chalaki, Vaishnav, Tadiparthi, Kwonjoon Lee, Ehsan Moradi-Pari, Charles Michael Lewis, Katia P, Sycara

TL;DR
This paper introduces a method to align multi-agent reinforcement learning communication with human language, enabling interpretable interactions and zero-shot generalization in teamwork scenarios.
Contribution
The authors propose a novel pipeline that grounds agent communication in human language using LLM-generated synthetic data, improving interpretability and generalization.
Findings
Language grounding maintains task performance.
Accelerates emergence of communication protocols.
Enables zero-shot generalization to unseen teammates.
Abstract
Multi-Agent Reinforcement Learning (MARL) methods have shown promise in enabling agents to learn a shared communication protocol from scratch and accomplish challenging team tasks. However, the learned language is usually not interpretable to humans or other agents not co-trained together, limiting its applicability in ad-hoc teamwork scenarios. In this work, we propose a novel computational pipeline that aligns the communication space between MARL agents with an embedding space of human natural language by grounding agent communications on synthetic data generated by embodied Large Language Models (LLMs) in interactive teamwork scenarios. Our results demonstrate that introducing language grounding not only maintains task performance but also accelerates the emergence of communication. Furthermore, the learned communication protocols exhibit zero-shot generalization capabilities in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsService-Oriented Architecture and Web Services · Multi-Agent Systems and Negotiation · Speech and dialogue systems
