Policy Evaluation in Decentralized POMDPs with Belief Sharing

Mert Kayaalp; Fatima Ghadieh; Ali H. Sayed

arXiv:2302.04151·cs.LG·May 17, 2023

Policy Evaluation in Decentralized POMDPs with Belief Sharing

Mert Kayaalp, Fatima Ghadieh, Ali H. Sayed

PDF

Open Access 1 Repo

TL;DR

This paper introduces a decentralized belief sharing method for cooperative policy evaluation in multi-agent systems with noisy observations, enabling agents to approximate centralized performance through local interactions.

Contribution

It proposes a novel decentralized belief formation strategy that facilitates information diffusion and parameter convergence in multi-agent POMDPs with limited communication.

Findings

01

Belief sharing improves policy evaluation accuracy.

02

Agents' parameters stay close to centralized baseline.

03

Method effective in multi-sensor target tracking simulations.

Abstract

Most works on multi-agent reinforcement learning focus on scenarios where the state of the environment is fully observable. In this work, we consider a cooperative policy evaluation task in which agents are not assumed to observe the environment state directly. Instead, agents can only have access to noisy observations and to belief vectors. It is well-known that finding global posterior distributions under multi-agent settings is generally NP-hard. As a remedy, we propose a fully decentralized belief forming strategy that relies on individual updates and on localized interactions over a communication network. In addition to the exchange of the beliefs, agents exploit the communication network by exchanging value function parameter estimates as well. We analytically show that the proposed strategy allows information to diffuse over the network, which in turn allows the agents'…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asl-epfl/decpomdp_policy_evaluation_w-belief_sharing
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Sensor Networks and Detection Algorithms