Statistically Model Checking PCTL Specifications on Markov Decision   Processes via Reinforcement Learning

Yu Wang; Nima Roohi; Matthew West; Mahesh Viswanathan; and Geir E.; Dullerud

arXiv:2004.00273·cs.LG·April 23, 2020·1 cites

Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning

Yu Wang, Nima Roohi, Matthew West, Mahesh Viswanathan, and Geir E., Dullerud

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning-based statistical model checking approach for verifying PCTL specifications on Markov Decision Processes, providing guarantees on error bounds and extending to both bounded and unbounded time horizons.

Contribution

It develops a novel UCB-based Q-learning method for statistical model checking of PCTL on MDPs, with provable error guarantees and extensions to unbounded specifications.

Findings

01

Effective verification of PCTL specifications demonstrated on case studies

02

Guarantees on error bounds for the proposed statistical model checking method

03

Extension of the approach to unbounded-time PCTL specifications

Abstract

Probabilistic Computation Tree Logic (PCTL) is frequently used to formally specify control objectives such as probabilistic reachability and safety. In this work, we focus on model checking PCTL specifications statistically on Markov Decision Processes (MDPs) by sampling, e.g., checking whether there exists a feasible policy such that the probability of reaching certain goal states is greater than a threshold. We use reinforcement learning to search for such a feasible policy for PCTL specifications, and then develop a statistical model checking (SMC) method with provable guarantees on its error. Specifically, we first use upper-confidence-bound (UCB) based Q-learning to design an SMC algorithm for bounded-time PCTL specifications, and then extend this algorithm to unbounded-time specifications by identifying a proper truncation time by checking the PCTL specification and its negation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Software Reliability and Analysis Research · Safety Systems Engineering in Autonomy

MethodsQ-Learning