Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Andreas Kontogiannis; Konstantinos Papathanasiou; Yi Shen; Giorgos Stamou; Michael M. Zavlanos; George Vouros

arXiv:2505.05262·cs.LG·June 16, 2025

Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Andreas Kontogiannis, Konstantinos Papathanasiou, Yi Shen, Giorgos Stamou, Michael M. Zavlanos, George Vouros

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel framework and algorithm for cooperative multi-agent reinforcement learning that improves state inference and exploration, leading to better performance in complex tasks.

Contribution

It proposes a new state modelling framework and the MARL SMPE algorithm that enhance agents' belief representations and exploration strategies in partially observable environments.

Findings

01

SMPE outperforms existing MARL algorithms in benchmark tasks.

02

Agents effectively infer meaningful belief states from observations.

03

Adversarial exploration improves discovery of high-value states.

Abstract

Learning to cooperate in distributed partially observable environments with no communication abilities poses significant challenges for multi-agent deep reinforcement learning (MARL). This paper addresses key concerns in this domain, focusing on inferring state representations from individual agent observations and leveraging these representations to enhance agents' exploration and collaborative task execution policies. To this end, we propose a novel state modelling framework for cooperative MARL, where agents infer meaningful belief representations of the non-observable state, with respect to optimizing their own policies, while filtering redundant and less informative joint state information. Building upon this framework, we propose the MARL SMPE algorithm. In SMPE, agents enhance their own policy's discriminative abilities under partial observability, explicitly by incorporating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ddaedalus/smpe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)