Limit Synchronization in Markov Decision Processes
Laurent Doyen, Thierry Massart, Mahsa Shirmohammadi

TL;DR
This paper studies the problem of limit synchronization in Markov decision processes, establishing decidability, complexity bounds, and memory requirements for different winning modes related to probability accumulation in states.
Contribution
It provides the first comprehensive analysis of synchronization modes in MDPs, including complexity classifications and memory bounds for strategies.
Findings
Decidability and PSPACE-completeness for all synchronization modes.
Exponential memory suffices and may be necessary for sure winning strategies.
Infinite or unbounded memory is necessary for almost-sure and limit-sure winning modes.
Abstract
Markov decision processes (MDP) are finite-state systems with both strategic and probabilistic choices. After fixing a strategy, an MDP produces a sequence of probability distributions over states. The sequence is eventually synchronizing if the probability mass accumulates in a single state, possibly in the limit. Precisely, for 0 <= p <= 1 the sequence is p-synchronizing if a probability distribution in the sequence assigns probability at least p to some state, and we distinguish three synchronization modes: (i) sure winning if there exists a strategy that produces a 1-synchronizing sequence; (ii) almost-sure winning if there exists a strategy that produces a sequence that is, for all epsilon > 0, a (1-epsilon)-synchronizing sequence; (iii) limit-sure winning if for all epsilon > 0, there exists a strategy that produces a (1-epsilon)-synchronizing sequence. We consider the problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · semigroups and automata theory · Logic, programming, and type systems
