Greatest Fixed Points of Probabilistic Min/Max Polynomial Equations, and   Reachability for Branching Markov Decision Processes

Kousha Etessami; Alistair Stewart; Mihalis Yannakakis

arXiv:1502.05533·cs.CC·April 8, 2016

Greatest Fixed Points of Probabilistic Min/Max Polynomial Equations, and Reachability for Branching Markov Decision Processes

Kousha Etessami, Alistair Stewart, Mihalis Yannakakis

PDF

TL;DR

This paper introduces polynomial-time algorithms for analyzing reachability probabilities in Branching Markov Decision Processes (BMDPs) using fixed point computations, enabling efficient approximation and strategy synthesis.

Contribution

It presents the first polynomial-time algorithms for approximating and deciding reachability probabilities in BMDPs via fixed point solutions of probabilistic polynomial systems.

Findings

01

Algorithms for approximating reachability probabilities within desired precision.

02

Polynomial-time algorithms for computing epsilon-optimal strategies.

03

Decidability results for qualitative reachability analysis in BMDPs.

Abstract

We give polynomial time algorithms for quantitative (and qualitative) reachability analysis for Branching Markov Decision Processes (BMDPs). Specifically, given a BMDP, and given an initial population, where the objective of the controller is to maximize (or minimize) the probability of eventually reaching a population that contains an object of a desired (or undesired) type, we give algorithms for approximating the supremum (infimum) reachability probability, within desired precision epsilon > 0, in time polynomial in the encoding size of the BMDP and in log(1/epsilon). We furthermore give P-time algorithms for computing epsilon-optimal strategies for both maximization and minimization of reachability probabilities. We also give P-time algorithms for all associated qualitative analysis problems, namely: deciding whether the optimal (supremum or infimum) reachability probabilities are 0…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.