A Framework for Sequential Planning in Multi-Agent Settings
P. Doshi, P. J. Gmytrasiewicz

TL;DR
This paper extends POMDPs to multi-agent scenarios by incorporating agent models into the state space, enabling belief-based decision making that accounts for other agents' beliefs and types, with convergence properties similar to single-agent POMDPs.
Contribution
It introduces a multi-agent POMDP framework with belief updates over other agents' models, capturing complex interactions while maintaining key properties of traditional POMDPs.
Findings
Belief updates and value functions exhibit properties like convergence and convexity.
Approximate solutions are computable despite the complexity of nested beliefs.
The framework is demonstrated through a simple application domain.
Abstract
This paper extends the framework of partially observable Markov decision processes (POMDPs) to multi-agent settings by incorporating the notion of agent models into the state space. Agents maintain beliefs over physical states of the environment and over models of other agents, and they use Bayesian updates to maintain their beliefs over time. The solutions map belief states to actions. Models of other agents may include their belief states and are related to agent types considered in games of incomplete information. We express the agents autonomy by postulating that their models are not directly manipulable or observable by other agents. We show that important properties of POMDPs, such as convergence of value iteration, the rate of convergence, and piece-wise linearity and convexity of the value functions carry over to our framework. Our approach complements a more traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
