Permissive Supervisor Synthesis for Markov Decision Processes through   Learning

Bo Wu; Xiaobin Zhang; Hai Lin

arXiv:1703.07351·cs.LO·March 23, 2017·2 cites

Permissive Supervisor Synthesis for Markov Decision Processes through Learning

Bo Wu, Xiaobin Zhang, Hai Lin

PDF

Open Access

TL;DR

This paper introduces a learning-based framework for synthesizing permissive supervisors in Markov Decision Processes, enabling distributed control in probabilistic systems while avoiding state space explosion.

Contribution

It presents a novel, iterative supervisor synthesis method combining learning and compositional model checking for probabilistic systems.

Findings

01

Framework guarantees termination and correctness.

02

Avoids state space explosion through compositional reasoning.

03

Supports distributed supervisor synthesis for MDPs.

Abstract

This paper considers the permissive supervisor synthesis for probabilistic systems modeled as Markov Decision Processes (MDP). Such systems are prevalent in power grids, transportation networks, communication networks and robotics. Unlike centralized planning and optimization based planning, we propose a novel supervisor synthesis framework based on learning and compositional model checking to generate permissive local supervisors in a distributed manner. With the recent advance in assume-guarantee reasoning verification for probabilistic systems, building the composed system can be avoided to alleviate the state space explosion and our framework learn the supervisors iteratively based on the counterexamples from verification. Our approach is guaranteed to terminate in finite steps and to be correct.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Advanced Software Engineering Methodologies · Software Reliability and Analysis Research