Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System

Haikuo Du; Fandi Gou; Yunze Cai

arXiv:2501.13727·cs.MA·April 2, 2025

Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System

Haikuo Du, Fandi Gou, Yunze Cai

PDF

Open Access 1 Repo

TL;DR

This paper introduces SS-MARL, a scalable and safe multi-agent reinforcement learning framework that uses graph-based message passing and constrained policy optimization to improve safety and scalability in large multi-agent systems.

Contribution

The paper proposes a novel graph-structured message passing network and constrained policy optimization for scalable and safe multi-agent reinforcement learning.

Findings

01

Outperforms baselines in safety and optimality trade-offs

02

Achieves superior scalability in large agent scenarios

03

Demonstrates effectiveness through simulation experiments

Abstract

Safety and scalability are two critical challenges faced by practical Multi-Agent Systems (MAS). However, existing Multi-Agent Reinforcement Learning (MARL) algorithms that rely solely on reward shaping are ineffective in ensuring safety, and their scalability is rather limited due to the fixed-size network output. To address these issues, we propose a novel framework, Scalable Safe MARL (SS-MARL), to enhance the safety and scalability of MARL methods. Leveraging the inherent graph structure of MAS, we design a multi-layer message passing network to aggregate local observations and communications of varying sizes. Furthermore, we develop a constrained joint policy optimization method in the setting of local observation to improve safety. Simulation experiments demonstrate that SS-MARL achieves a better trade-off between optimality and safety compared to baselines, and its scalability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qeneb/ss-marl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsElevator Systems and Control · Reinforcement Learning in Robotics

MethodsMixing Adam and SGD