MARLIN: Multi-Agent Reinforcement Learning Guided by Language-Based Inter-Robot Negotiation

Toby Godfrey; William Hunt; Mohammad D. Soorati

arXiv:2410.14383·cs.RO·April 14, 2026

MARLIN: Multi-Agent Reinforcement Learning Guided by Language-Based Inter-Robot Negotiation

Toby Godfrey, William Hunt, Mohammad D. Soorati

PDF

1 Repo

TL;DR

MARLIN integrates language-based negotiation with reinforcement learning to improve multi-robot training safety and efficiency, enabling better early-stage performance and safer exploration.

Contribution

The paper presents a hybrid framework combining language models with reinforcement learning for multi-robot systems, enhancing early training safety and performance.

Findings

01

Hybrid approach outperforms standard RL in early training stages.

02

Language-based negotiation improves safety during initial learning.

03

System achieves higher early performance without sacrificing final results.

Abstract

Multi-agent reinforcement learning is a key method for training multi-robot systems. Through rewarding or punishing robots over a series of episodes according to their performance, they can be trained and then deployed in the real world. However, poorly trained policies can lead to unsafe behaviour during early training stages. We introduce Multi-Agent Reinforcement Learning guided by language-based Inter-robot Negotiation (MARLIN), a hybrid framework in which large language models provide high-level planning before the reinforcement learning policy has learned effective behaviours. Robots use language models to negotiate actions and generate plans that guide policy learning. The system dynamically switches between reinforcement learning and language-model-based negotiation during training, enabling safer and more effective exploration. MARLIN is evaluated using both simulated and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SooratiLab/MARLIN
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.