Safe Multiagent Coordination via Entropic Exploration

Ayhan Alp Aydeniz; Enrico Marchesini; Robert Loftin; Christopher Amato; and Kagan Tumer

arXiv:2412.20361·cs.MA·August 27, 2025

Safe Multiagent Coordination via Entropic Exploration

Ayhan Alp Aydeniz, Enrico Marchesini, Robert Loftin, Christopher Amato, and Kagan Tumer

PDF

Open Access

TL;DR

This paper introduces E2C, a novel entropic exploration method for safe multiagent reinforcement learning that enhances exploration and safety by maximizing observation entropy, outperforming existing methods in complex tasks.

Contribution

It proposes a new entropic exploration approach for constrained multiagent RL, emphasizing joint team constraints and demonstrating improved safety and performance.

Findings

01

E2C matches or exceeds baseline performance.

02

E2C reduces unsafe behaviors by up to 50%.

03

E2C effectively balances exploration and safety.

Abstract

Many real-world multiagent learning problems involve safety concerns. In these setups, typical safe reinforcement learning algorithms constrain agents' behavior, limiting exploration -- a crucial component for discovering effective cooperative multiagent behaviors. Moreover, the multiagent literature typically models individual constraints for each agent and has yet to investigate the benefits of using joint team constraints. In this work, we analyze these team constraints from a theoretical and practical perspective and propose entropic exploration for constrained multiagent reinforcement learning (E2C) to address the exploration issue. E2C leverages observation entropy maximization to incentivize exploration and facilitate learning safe and effective cooperative behaviors. Experiments across increasingly complex domains show that E2C agents match or surpass common unconstrained and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Distributed Control Multi-Agent Systems