Verification of indefinite-horizon POMDPs

Alexander Bork; Sebastian Junges; Joost-Pieter Katoen; Tim Quatmann

arXiv:2007.00102·cs.AI·July 2, 2020

Verification of indefinite-horizon POMDPs

Alexander Bork, Sebastian Junges, Joost-Pieter Katoen, Tim Quatmann

PDF

1 Repo

TL;DR

This paper introduces an abstraction-refinement framework for verifying indefinite-horizon POMDPs, improving scalability by extending the Lovejoy-approach to handle policies based on observation histories.

Contribution

It extends the Lovejoy-approach with an abstraction-refinement framework specifically for POMDP verification, enhancing scalability and practical applicability.

Findings

01

Significant scalability improvements demonstrated in experiments

02

Framework effectively handles policies based on observation histories

03

Extends existing verification methods to more complex POMDPs

Abstract

The verification problem in MDPs asks whether, for any policy resolving the nondeterminism, the probability that something bad happens is bounded by some given threshold. This verification problem is often overly pessimistic, as the policies it considers may depend on the complete system state. This paper considers the verification problem for partially observable MDPs, in which the policies make their decisions based on (the history of) the observations emitted by the system. We present an abstraction-refinement framework extending previous instantiations of the Lovejoy-approach. Our experiments show that this framework significantly improves the scalability of the approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

moves-rwth/indefinite-horizon-pomdps
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.