Cooperative Multi-Agent Assignment over Stochastic Graphs via   Constrained Reinforcement Learning

Leopoldo Agorio; Sean Van Alen; Santiago Paternain; Miguel; Calvo-Fullana; Juan Andres Bazerque

arXiv:2502.20462·eess.SY·March 3, 2025

Cooperative Multi-Agent Assignment over Stochastic Graphs via Constrained Reinforcement Learning

Leopoldo Agorio, Sean Van Alen, Santiago Paternain, Miguel, Calvo-Fullana, Juan Andres Bazerque

PDF

TL;DR

This paper introduces a novel multi-agent reinforcement learning approach for dynamic task coordination over stochastic networks, allowing agents to adapt policies in real-time without requiring dual variable convergence.

Contribution

It proposes a new formulation where dual variables cycle, enabling scalable, feasible multi-agent coordination with limited communication and bounded estimation errors.

Findings

01

Agents achieve almost sure feasibility in dynamic environments

02

The method works with stochastic, time-varying network connectivity

03

Numerical experiments demonstrate successful multi-robot patrols

Abstract

Constrained multi-agent reinforcement learning offers the framework to design scalable and almost surely feasible solutions for teams of agents operating in dynamic environments to carry out conflicting tasks. We address the challenges of multi-agent coordination through an unconventional formulation in which the dual variables are not driven to convergence but are free to cycle, enabling agents to adapt their policies dynamically based on real-time constraint satisfaction levels. The coordination relies on a light single-bit communication protocol over a network with stochastic connectivity. Using this gossiped information, agents update local estimates of the dual variables. Furthermore, we modify the local dual dynamics by introducing a contraction factor, which lets us use finite communication buffers and keep the estimation error bounded. Under this model, we provide theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.