Greatest Fixed Points of Probabilistic Min/Max Polynomial Equations, and Reachability for Branching Markov Decision Processes
Kousha Etessami, Alistair Stewart, Mihalis Yannakakis

TL;DR
This paper introduces polynomial-time algorithms for analyzing reachability probabilities in Branching Markov Decision Processes (BMDPs) using fixed point computations, enabling efficient approximation and strategy synthesis.
Contribution
It presents the first polynomial-time algorithms for approximating and deciding reachability probabilities in BMDPs via fixed point solutions of probabilistic polynomial systems.
Findings
Algorithms for approximating reachability probabilities within desired precision.
Polynomial-time algorithms for computing epsilon-optimal strategies.
Decidability results for qualitative reachability analysis in BMDPs.
Abstract
We give polynomial time algorithms for quantitative (and qualitative) reachability analysis for Branching Markov Decision Processes (BMDPs). Specifically, given a BMDP, and given an initial population, where the objective of the controller is to maximize (or minimize) the probability of eventually reaching a population that contains an object of a desired (or undesired) type, we give algorithms for approximating the supremum (infimum) reachability probability, within desired precision epsilon > 0, in time polynomial in the encoding size of the BMDP and in log(1/epsilon). We furthermore give P-time algorithms for computing epsilon-optimal strategies for both maximization and minimization of reachability probabilities. We also give P-time algorithms for all associated qualitative analysis problems, namely: deciding whether the optimal (supremum or infimum) reachability probabilities are 0…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
