Verification of general Markov decision processes by approximate   similarity relations and policy refinement

S. Haesaert; S. Esmaeil Zadeh Soudjani; A. Abate

arXiv:1605.09557·cs.SY·June 1, 2016·1 cites

Verification of general Markov decision processes by approximate similarity relations and policy refinement

S. Haesaert, S. Esmaeil Zadeh Soudjani, A. Abate

PDF

Open Access

TL;DR

This paper introduces new approximate similarity relations for general Markov decision processes with uncountably-infinite states, enabling effective verification and control refinement through metric-based comparisons.

Contribution

It develops novel probabilistic similarity relations based on metrics, facilitating policy synthesis and control refinement for complex MDPs with infinite states.

Findings

01

New similarity relations enable verification of complex MDPs.

02

Relations allow trade-offs between probability deviations and output distances.

03

Effective policy refinement from abstract models demonstrated.

Abstract

In this work we introduce new approximate similarity relations that are shown to be key for policy (or control) synthesis over general Markov decision processes. The models of interest are discrete-time Markov decision processes, endowed with uncountably-infinite state spaces and metric output (or observation) spaces. The new relations, underpinned by the use of metrics, allow in particular for a useful trade-off between deviations over probability distributions on states, and distances between model outputs. We show that the new probabilistic similarity relations, inspired by a notion of simulation developed for finite-state models, can be effectively employed over general Markov decision processes for verification purposes, and specifically for control refinement from abstract models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Petri Nets in System Modeling · Machine Learning and Algorithms