Context-Aware Bayesian Network Actor-Critic Methods for Cooperative   Multi-Agent Reinforcement Learning

Dingyang Chen; Qi Zhang

arXiv:2306.01920·cs.MA·June 6, 2023·1 cites

Context-Aware Bayesian Network Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning

Dingyang Chen, Qi Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Bayesian network approach to model correlated actions in cooperative multi-agent reinforcement learning, providing theoretical justification and practical algorithms that improve coordination and scalability.

Contribution

It proposes a novel Bayesian network framework for action correlation in MARL, with theoretical analysis and algorithms that adaptively learn context-aware policies.

Findings

01

Theoretical proof of convergence to Nash equilibria.

02

Empirical improvements on MARL benchmarks.

03

Dynamic adjustment of policy sparsity enhances performance.

Abstract

Executing actions in a correlated manner is a common strategy for human coordination that often leads to better cooperation, which is also potentially beneficial for cooperative multi-agent reinforcement learning (MARL). However, the recent success of MARL relies heavily on the convenient paradigm of purely decentralized execution, where there is no action correlation among agents for scalability considerations. In this work, we introduce a Bayesian network to inaugurate correlations between agents' action selections in their joint policy. Theoretically, we establish a theoretical justification for why action dependencies are beneficial by deriving the multi-agent policy gradient formula under such a Bayesian network joint policy and proving its global convergence to Nash equilibria under tabular softmax policy parameterization in cooperative Markov games. Further, by equipping existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dchen48/bnpg
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Decision-Making and Behavioral Economics · Reinforcement Learning in Robotics

MethodsSoftmax