Safe Multi-agent Reinforcement Learning with Natural Language Constraints
Ziyan Wang, Meng Fang, Tristan Tomilin, Fei Fang, Yali Du

TL;DR
This paper introduces SMALL, a novel method that uses natural language processing to incorporate textual constraints into multi-agent reinforcement learning, making safe and effective policies more accessible without extensive domain expertise.
Contribution
The paper presents a new approach that leverages language models to interpret natural language constraints and integrate them into Safe MARL, reducing the need for pre-defined mathematical constraints.
Findings
SMALL achieves comparable rewards to traditional methods.
SMALL significantly reduces constraint violations.
The LaMaSafe benchmark effectively evaluates natural language constraint adherence.
Abstract
The role of natural language constraints in Safe Multi-agent Reinforcement Learning (MARL) is crucial, yet often overlooked. While Safe MARL has vast potential, especially in fields like robotics and autonomous vehicles, its full potential is limited by the need to define constraints in pre-designed mathematical terms, which requires extensive domain expertise and reinforcement learning knowledge, hindering its broader adoption. To address this limitation and make Safe MARL more accessible and adaptable, we propose a novel approach named Safe Multi-agent Reinforcement Learning with Natural Language constraints (SMALL). Our method leverages fine-tuned language models to interpret and process free-form textual constraints, converting them into semantic embeddings that capture the essence of prohibited states and behaviours. These embeddings are then integrated into the multi-agent policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsFuzzy Logic and Control Systems · Multi-Agent Systems and Negotiation · Reinforcement Learning in Robotics
