Action-Graph Policies: Learning Action Co-dependencies in Multi-Agent Reinforcement Learning

Nikunj Gupta; James Zachary Hare; Jesse Milzman; Rajgopal Kannan; Viktor Prasanna

arXiv:2602.17009·cs.LG·February 24, 2026

Action-Graph Policies: Learning Action Co-dependencies in Multi-Agent Reinforcement Learning

Nikunj Gupta, James Zachary Hare, Jesse Milzman, Rajgopal Kannan, Viktor Prasanna

PDF

Open Access

TL;DR

This paper introduces Action Graph Policies (AGP), a novel approach for modeling action dependencies in multi-agent reinforcement learning, leading to more coordinated and optimal joint actions in complex environments.

Contribution

AGP provides a new framework for capturing inter-agent action dependencies, enhancing the expressiveness and coordination capabilities of policies in MARL.

Findings

01

AGP achieves 80-95% success in coordination tasks.

02

AGP outperforms baselines in diverse environments.

03

AGP induces more expressive joint policies.

Abstract

Coordinating actions is the most fundamental form of cooperation in multi-agent reinforcement learning (MARL). Successful decentralized decision-making often depends not only on good individual actions, but on selecting compatible actions across agents to synchronize behavior, avoid conflicts, and satisfy global constraints. In this paper, we propose Action Graph Policies (AGP), that model dependencies among agents' available action choices. It constructs, what we call, \textit{coordination contexts}, that enable agents to condition their decisions on global action dependencies. Theoretically, we show that AGPs induce a strictly more expressive joint policy compared to fully independent policies and can realize coordinated joint actions that are provably more optimal than greedy execution even from centralized value-decomposition methods. Empirically, we show that AGP achieves 80-95\%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks