Action-Graph Policies: Learning Action Co-dependencies in Multi-Agent Reinforcement Learning
Nikunj Gupta, James Zachary Hare, Jesse Milzman, Rajgopal Kannan, Viktor Prasanna

TL;DR
This paper introduces Action Graph Policies (AGP), a novel approach for modeling action dependencies in multi-agent reinforcement learning, leading to more coordinated and optimal joint actions in complex environments.
Contribution
AGP provides a new framework for capturing inter-agent action dependencies, enhancing the expressiveness and coordination capabilities of policies in MARL.
Findings
AGP achieves 80-95% success in coordination tasks.
AGP outperforms baselines in diverse environments.
AGP induces more expressive joint policies.
Abstract
Coordinating actions is the most fundamental form of cooperation in multi-agent reinforcement learning (MARL). Successful decentralized decision-making often depends not only on good individual actions, but on selecting compatible actions across agents to synchronize behavior, avoid conflicts, and satisfy global constraints. In this paper, we propose Action Graph Policies (AGP), that model dependencies among agents' available action choices. It constructs, what we call, \textit{coordination contexts}, that enable agents to condition their decisions on global action dependencies. Theoretically, we show that AGPs induce a strictly more expressive joint policy compared to fully independent policies and can realize coordinated joint actions that are provably more optimal than greedy execution even from centralized value-decomposition methods. Empirically, we show that AGP achieves 80-95\%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks
