Factored Online Planning in Many-Agent POMDPs

Maris F.L. Galesloot; Thiago D. Sim\~ao; Sebastian Junges; Nils Jansen

arXiv:2312.11434·cs.AI·February 26, 2024·1 cites

Factored Online Planning in Many-Agent POMDPs

Maris F.L. Galesloot, Thiago D. Sim\~ao, Sebastian Junges, Nils Jansen

PDF

Open Access 1 Video

TL;DR

This paper introduces a scalable online planning approach for multi-agent POMDPs that simultaneously improves value and belief estimation, enabling effective decision-making in systems with many agents.

Contribution

It presents a novel combination of weighted particle filtering, scalable belief approximation, and exploitation of agent locality for multi-agent POMDP planning.

Findings

01

Outperforms state-of-the-art methods with many agents

02

Competitive with existing methods in small-agent settings

03

Enhances scalability and accuracy of online planning

Abstract

In centralized multi-agent systems, often modeled as multi-agent partially observable Markov decision processes (MPOMDPs), the action and observation spaces grow exponentially with the number of agents, making the value and belief estimation of single-agent online planning ineffective. Prior work partially tackles value estimation by exploiting the inherent structure of multi-agent settings via so-called coordination graphs. Additionally, belief estimation methods have been improved by incorporating the likelihood of observations into the approximation. However, the challenges of value estimation and belief estimation have only been tackled individually, which prevents existing methods from scaling to settings with many agents. Therefore, we address these challenges simultaneously. First, we introduce weighted particle filtering to a sample-based online planner for MPOMDPs. Second, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Factored Online Planning in Many-Agent POMDPs· underline

Taxonomy

TopicsBayesian Modeling and Causal Inference · Data Stream Mining Techniques · Advanced Graph Neural Networks