Approximating the Termination Value of One-Counter MDPs and Stochastic Games
Tom\'a\v{s} Br\'azdil, V\'aclav Bro\v{z}ek, Kousha Etessami, Anton\'in, Ku\v{c}era

TL;DR
This paper proves that the quantitative termination values of one-counter MDPs and stochastic games are computable, enabling approximation within any desired precision and the computation of epsilon-optimal strategies.
Contribution
It establishes the computability of approximation problems for termination values in OC-MDPs and OC-SSGs, a previously open question, and introduces martingale-based techniques for this purpose.
Findings
All quantitative approximation problems are computable.
Epsilon-optimal strategies can be computed for both players.
A novel martingale approach is used to derive bounds.
Abstract
One-counter MDPs (OC-MDPs) and one-counter simple stochastic games (OC-SSGs) are 1-player, and 2-player turn-based zero-sum, stochastic games played on the transition graph of classic one-counter automata (equivalently, pushdown automata with a 1-letter stack alphabet). A key objective for the analysis and verification of these games is the termination objective, where the players aim to maximize (minimize, respectively) the probability of hitting counter value 0, starting at a given control state and given counter value. Recently, we studied qualitative decision problems ("is the optimal termination value = 1?") for OC-MDPs (and OC-SSGs) and showed them to be decidable in P-time (in NP and coNP, respectively). However, quantitative decision and approximation problems ("is the optimal termination value ? p", or "approximate the termination value within epsilon") are far more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · semigroups and automata theory · Logic, programming, and type systems
