Safe Multi-agent Reinforcement Learning with Natural Language   Constraints

Ziyan Wang; Meng Fang; Tristan Tomilin; Fei Fang; Yali Du

arXiv:2405.20018·cs.MA·May 31, 2024

Safe Multi-agent Reinforcement Learning with Natural Language Constraints

Ziyan Wang, Meng Fang, Tristan Tomilin, Fei Fang, Yali Du

PDF

Open Access 1 Video

TL;DR

This paper introduces SMALL, a novel method that uses natural language processing to incorporate textual constraints into multi-agent reinforcement learning, making safe and effective policies more accessible without extensive domain expertise.

Contribution

The paper presents a new approach that leverages language models to interpret natural language constraints and integrate them into Safe MARL, reducing the need for pre-defined mathematical constraints.

Findings

01

SMALL achieves comparable rewards to traditional methods.

02

SMALL significantly reduces constraint violations.

03

The LaMaSafe benchmark effectively evaluates natural language constraint adherence.

Abstract

The role of natural language constraints in Safe Multi-agent Reinforcement Learning (MARL) is crucial, yet often overlooked. While Safe MARL has vast potential, especially in fields like robotics and autonomous vehicles, its full potential is limited by the need to define constraints in pre-designed mathematical terms, which requires extensive domain expertise and reinforcement learning knowledge, hindering its broader adoption. To address this limitation and make Safe MARL more accessible and adaptable, we propose a novel approach named Safe Multi-agent Reinforcement Learning with Natural Language constraints (SMALL). Our method leverages fine-tuned language models to interpret and process free-form textual constraints, converting them into semantic embeddings that capture the essence of prohibited states and behaviours. These embeddings are then integrated into the multi-agent policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Safe Multi-agent Reinforcement Learning with Natural Language Constraints· underline

Taxonomy

TopicsFuzzy Logic and Control Systems · Multi-Agent Systems and Negotiation · Reinforcement Learning in Robotics