Distributionally Robust Cooperative Multi-Agent Reinforcement Learning via Robust Value Factorization

Chengrui Qu; Christopher Yeh; Kishan Panaganti; Eric Mazumdar; Adam Wierman

arXiv:2602.11437·cs.AI·February 13, 2026

Distributionally Robust Cooperative Multi-Agent Reinforcement Learning via Robust Value Factorization

Chengrui Qu, Christopher Yeh, Kishan Panaganti, Eric Mazumdar, Adam Wierman

PDF

Open Access

TL;DR

This paper introduces a distributionally robust value factorization framework for cooperative multi-agent reinforcement learning, enhancing robustness to environmental uncertainties and improving out-of-distribution performance.

Contribution

It proposes the DrIGM principle and develops robust variants of existing value-factorization architectures that ensure robustness and scalability in real-world uncertain environments.

Findings

01

Improved out-of-distribution performance on SustainGym and StarCraft environments.

02

Robust value factorization methods compatible with existing architectures.

03

Theoretical guarantees for robustness in multi-agent systems.

Abstract

Cooperative multi-agent reinforcement learning (MARL) commonly adopts centralized training with decentralized execution, where value-factorization methods enforce the individual-global-maximum (IGM) principle so that decentralized greedy actions recover the team-optimal joint action. However, the reliability of this recipe in real-world settings remains unreliable due to environmental uncertainties arising from the sim-to-real gap, model mismatch, and system noise. We address this gap by introducing Distributionally robust IGM (DrIGM), a principle that requires each agent's robust greedy action to align with the robust team-optimal joint action. We show that DrIGM holds for a novel definition of robust individual action values, which is compatible with decentralized greedy execution and yields a provable robustness guarantee for the whole system. Building on this foundation, we derive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Adversarial Robustness in Machine Learning