Bridging the Gap between Partially Observable Stochastic Games and   Sparse POMDP Methods

Tyler Becker; Zachary Sunberg

arXiv:2405.18703·cs.GT·October 30, 2024

Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP Methods

Tyler Becker, Zachary Sunberg

PDF

Open Access

TL;DR

This paper introduces a unified framework combining POMDP techniques and game-theoretic methods to efficiently solve large-scale partially observable stochastic games, enhancing autonomous multi-agent decision-making.

Contribution

It presents a novel approach that unifies belief approximation and equilibrium search, providing a theoretical foundation and empirical validation for scalable POSG solutions.

Findings

01

Effective belief approximation bounds the error in POSG planning.

02

The approach enables handling very large state spaces in multi-agent scenarios.

03

Empirical results demonstrate improved planning in complex environments.

Abstract

Many real-world decision problems involve the interaction of multiple self-interested agents with limited sensing ability. The partially observable stochastic game (POSG) provides a mathematical framework for modeling these problems, however solving a POSG requires difficult reasoning over two critical factors: (1) information revealed by partial observations and (2) decisions other agents make. In the single agent case, partially observable Markov decision process (POMDP) planning can efficiently address partial observability with particle filtering. In the multi-agent case, extensive form game solution methods account for other agent's decisions, but preclude belief approximation. We propose a unifying framework that combines POMDP-inspired state distribution approximation and game-theoretic equilibrium search on information sets. This paper lays a theoretical foundation for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications