R3DM: Enabling Role Discovery and Diversity Through Dynamics Models in Multi-agent Reinforcement Learning

Harsh Goel; Mohammad Omama; Behdad Chalaki; Vaishnav Tadiparthi; Ehsan Moradi Pari; Sandeep Chinchali

arXiv:2505.24265·cs.MA·May 1, 2026

R3DM: Enabling Role Discovery and Diversity Through Dynamics Models in Multi-agent Reinforcement Learning

Harsh Goel, Mohammad Omama, Behdad Chalaki, Vaishnav Tadiparthi, Ehsan Moradi Pari, Sandeep Chinchali

PDF

1 Repo 1 Video

TL;DR

R3DM introduces a novel role discovery framework in multi-agent reinforcement learning that enhances coordination and diversity by leveraging dynamics models and mutual information maximization.

Contribution

It proposes a new role-based MARL method that learns emergent roles through contrastive learning and dynamics models, improving multi-agent coordination.

Findings

01

R3DM outperforms state-of-the-art MARL methods on SMAC and SMACv2 benchmarks.

02

It increases multi-agent win rates by up to 20%.

03

The approach promotes diversity in agent behaviors through learned intrinsic rewards.

Abstract

Multi-agent reinforcement learning (MARL) has achieved significant progress in large-scale traffic control, autonomous vehicles, and robotics. Drawing inspiration from biological systems where roles naturally emerge to enable coordination, role-based MARL methods have been proposed to enhance cooperation learning for complex tasks. However, existing methods exclusively derive roles from an agent's past experience during training, neglecting their influence on its future trajectories. This paper introduces a key insight: an agent's role should shape its future behavior to enable effective coordination. Hence, we propose Role Discovery and Diversity through Dynamics Models (R3DM), a novel role-based MARL framework that learns emergent roles by maximizing the mutual information between agents' roles, observed trajectories, and expected future behaviors. R3DM optimizes the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UTAustin-SwarmLab/R3DM
github

Videos

R3DM: Enabling Role Discovery and Diversity Through Dynamics Models in Multi-agent Reinforcement Learning· slideslive