Scenario-Based Verification of Uncertain MDPs
Murat Cubuktepe, Nils Jansen, Sebastian Junges, Joost-Pieter Katoen,, Ufuk Topcu

TL;DR
This paper introduces a scenario-based method for verifying uncertain Markov decision processes with unknown parameter distributions, enabling probability estimation of satisfying temporal logic specifications through convex optimization.
Contribution
It proposes a novel scenario optimization approach that provides high-confidence probability bounds for uncertain MDPs without dependence on state or parameter space size.
Findings
Few thousand samples suffice for high-confidence bounds
Method is scalable to large benchmarks
Provides probabilistic guarantees for verification
Abstract
We consider Markov decision processes (MDPs) in which the transition probabilities and rewards belong to an uncertainty set parametrized by a collection of random variables. The probability distributions for these random parameters are unknown. The problem is to compute the probability to satisfy a temporal logic specification within any MDP that corresponds to a sample from these unknown distributions. In general, this problem is undecidable, and we resort to techniques from so-called scenario optimization. Based on a finite number of samples of the uncertain parameters, each of which induces an MDP, the proposed method estimates the probability of satisfying the specification by solving a finite-dimensional convex optimization problem. The number of samples required to obtain a high confidence on this estimate is independent from the number of states and the number of random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification
