Multi-Environment POMDPs: Discrete Model Uncertainty Under Partial Observability

Eline M. Bovy; Caleb Probine; Marnix Suilen; Ufuk Topcu; Nils Jansen

arXiv:2510.23744·cs.AI·October 29, 2025

Multi-Environment POMDPs: Discrete Model Uncertainty Under Partial Observability

Eline M. Bovy, Caleb Probine, Marnix Suilen, Ufuk Topcu, Nils Jansen

PDF

TL;DR

This paper introduces Multi-Environment POMDPs (ME-POMDPs) to handle discrete model uncertainty across multiple domains, providing algorithms for robust policy computation under partial observability and model disagreement.

Contribution

It generalizes ME-POMDPs to adversarial-belief POMDPs and offers reduction techniques and algorithms for robust policy computation in multi-environment settings.

Findings

01

Algorithms for exact and approximate robust policy computation.

02

Extension of POMDP benchmarks to multi-environment scenarios.

03

Demonstration of policies on standard benchmarks.

Abstract

Multi-environment POMDPs (ME-POMDPs) extend standard POMDPs with discrete model uncertainty. ME-POMDPs represent a finite set of POMDPs that share the same state, action, and observation spaces, but may arbitrarily vary in their transition, observation, and reward models. Such models arise, for instance, when multiple domain experts disagree on how to model a problem. The goal is to find a single policy that is robust against any choice of POMDP within the set, i.e., a policy that maximizes the worst-case reward across all POMDPs. We generalize and expand on existing work in the following way. First, we show that ME-POMDPs can be generalized to POMDPs with sets of initial beliefs, which we call adversarial-belief POMDPs (AB-POMDPs). Second, we show that any arbitrary ME-POMDP can be reduced to a ME-POMDP that only varies in its transition and reward functions or only in its observation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.