Strategy Synthesis in POMDPs via Game-Based Abstractions

Leonore Winterer; Sebastian Junges; Ralf Wimmer; Nils Jansen; Ufuk; Topcu; Joost-Pieter Katoen; and Bernd Becker

arXiv:1708.04236·cs.RO·May 28, 2019

Strategy Synthesis in POMDPs via Game-Based Abstractions

Leonore Winterer, Sebastian Junges, Ralf Wimmer, Nils Jansen, Ufuk, Topcu, Joost-Pieter Katoen, and Bernd Becker

PDF

Open Access

TL;DR

This paper introduces an abstraction refinement framework that converts POMDPs into probabilistic games, enabling efficient strategy synthesis with safety guarantees for large planning problems involving partial observability.

Contribution

It presents a novel method to approximate POMDP strategies through game-based abstractions, improving scalability and safety guarantees over existing approaches.

Findings

01

Handles problems several orders-of-magnitude larger than existing solvers.

02

Provides safety guarantees not available in most current POMDP solvers.

03

Uses abstraction refinement to improve strategy quality iteratively.

Abstract

We study synthesis problems with constraints in partially observable Markov decision processes (POMDPs), where the objective is to compute a strategy for an agent that is guaranteed to satisfy certain safety and performance specifications. Verification and strategy synthesis for POMDPs are, however, computationally intractable in general. We alleviate this difficulty by focusing on planning applications and exploiting typical structural properties of such scenarios; for instance, we assume that the agent has the ability to observe its own position inside an environment. We propose an abstraction refinement framework which turns such a POMDP model into a (fully observable) probabilistic two-player game (PG). For the obtained PGs, efficient verification and synthesis tools allow to determine strategies with optimal safety and performance measures, which approximate optimal schedulers on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Advanced Software Engineering Methodologies · Reinforcement Learning in Robotics