Certified Policy Verification and Synthesis for MDPs under   Distributional Reach-avoidance Properties

S. Akshay; Krishnendu Chatterjee; Tobias Meggendorfer; {\DJ}or{\dj}e; \v{Z}ikeli\'c

arXiv:2405.04015·cs.AI·May 8, 2024

Certified Policy Verification and Synthesis for MDPs under Distributional Reach-avoidance Properties

S. Akshay, Krishnendu Chatterjee, Tobias Meggendorfer, {\DJ}or{\dj}e, \v{Z}ikeli\'c

PDF

Open Access

TL;DR

This paper introduces methods for verifying and synthesizing policies in Markov Decision Processes with distributional reach-avoidance properties, providing formal certificates to ensure safety and target reachability in probabilistic models.

Contribution

The work presents the novel concept of distributional reach-avoid certificates and automated procedures for their synthesis and verification in MDPs.

Findings

01

Successfully applied to multi-agent robot-swarm models

02

Able to synthesize certified policies and verify existing ones

03

Demonstrates effectiveness on complex non-trivial examples

Abstract

Markov Decision Processes (MDPs) are a classical model for decision making in the presence of uncertainty. Often they are viewed as state transformers with planning objectives defined with respect to paths over MDP states. An increasingly popular alternative is to view them as distribution transformers, giving rise to a sequence of probability distributions over MDP states. For instance, reachability and safety properties in modeling robot swarms or chemical reaction networks are naturally defined in terms of probability distributions over states. Verifying such distributional properties is known to be hard and often beyond the reach of classical state-based verification techniques. In this work, we consider the problems of certified policy (i.e. controller) verification and synthesis in MDPs under distributional reach-avoidance specifications. By certified we mean that, along with a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications

MethodsSparse Evolutionary Training