Strategy Synthesis in POMDPs via Game-Based Abstractions
Leonore Winterer, Sebastian Junges, Ralf Wimmer, Nils Jansen, Ufuk, Topcu, Joost-Pieter Katoen, and Bernd Becker

TL;DR
This paper introduces an abstraction refinement framework that converts POMDPs into probabilistic games, enabling efficient strategy synthesis with safety guarantees for large planning problems involving partial observability.
Contribution
It presents a novel method to approximate POMDP strategies through game-based abstractions, improving scalability and safety guarantees over existing approaches.
Findings
Handles problems several orders-of-magnitude larger than existing solvers.
Provides safety guarantees not available in most current POMDP solvers.
Uses abstraction refinement to improve strategy quality iteratively.
Abstract
We study synthesis problems with constraints in partially observable Markov decision processes (POMDPs), where the objective is to compute a strategy for an agent that is guaranteed to satisfy certain safety and performance specifications. Verification and strategy synthesis for POMDPs are, however, computationally intractable in general. We alleviate this difficulty by focusing on planning applications and exploiting typical structural properties of such scenarios; for instance, we assume that the agent has the ability to observe its own position inside an environment. We propose an abstraction refinement framework which turns such a POMDP model into a (fully observable) probabilistic two-player game (PG). For the obtained PGs, efficient verification and synthesis tools allow to determine strategies with optimal safety and performance measures, which approximate optimal schedulers on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Advanced Software Engineering Methodologies · Reinforcement Learning in Robotics
