Language Grounded Multi-agent Reinforcement Learning with   Human-interpretable Communication

Huao Li; Hossein Nourkhiz Mahjoub; Behdad Chalaki; Vaishnav; Tadiparthi; Kwonjoon Lee; Ehsan Moradi-Pari; Charles Michael Lewis; Katia P; Sycara

arXiv:2409.17348·cs.MA·November 26, 2024

Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication

Huao Li, Hossein Nourkhiz Mahjoub, Behdad Chalaki, Vaishnav, Tadiparthi, Kwonjoon Lee, Ehsan Moradi-Pari, Charles Michael Lewis, Katia P, Sycara

PDF

Open Access 1 Video

TL;DR

This paper introduces a method to align multi-agent reinforcement learning communication with human language, enabling interpretable interactions and zero-shot generalization in teamwork scenarios.

Contribution

The authors propose a novel pipeline that grounds agent communication in human language using LLM-generated synthetic data, improving interpretability and generalization.

Findings

01

Language grounding maintains task performance.

02

Accelerates emergence of communication protocols.

03

Enables zero-shot generalization to unseen teammates.

Abstract

Multi-Agent Reinforcement Learning (MARL) methods have shown promise in enabling agents to learn a shared communication protocol from scratch and accomplish challenging team tasks. However, the learned language is usually not interpretable to humans or other agents not co-trained together, limiting its applicability in ad-hoc teamwork scenarios. In this work, we propose a novel computational pipeline that aligns the communication space between MARL agents with an embedding space of human natural language by grounding agent communications on synthetic data generated by embodied Large Language Models (LLMs) in interactive teamwork scenarios. Our results demonstrate that introducing language grounding not only maintains task performance but also accelerates the emergence of communication. Furthermore, the learned communication protocols exhibit zero-shot generalization capabilities in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication· slideslive

Taxonomy

TopicsService-Oriented Architecture and Web Services · Multi-Agent Systems and Negotiation · Speech and dialogue systems