Solving simple stochastic games with few coin toss positions
Rasmus Ibsen-Jensen, Peter Bro Miltersen

TL;DR
This paper introduces an improved algorithm for solving simple stochastic games with few coin toss positions, combining value iteration and retrograde analysis to achieve better time complexity.
Contribution
It presents a novel algorithm that improves the time bounds for solving simple stochastic games with limited coin toss positions.
Findings
Achieves a time bound of O(r 2^r (r log r + n))
Uses extremal combinatorics for worst-case analysis
Simplifies the approach while improving efficiency
Abstract
Gimbert and Horn gave an algorithm for solving simple stochastic games with running time O(r! n) where n is the number of positions of the simple stochastic game and r is the number of its coin toss positions. Chatterjee et al. pointed out that a variant of strategy iteration can be implemented to solve this problem in time 4^r r^{O(1)} n^{O(1)}. In this paper, we show that an algorithm combining value iteration with retrograde analysis achieves a time bound of O(r 2^r (r log r + n)), thus improving both time bounds. While the algorithm is simple, the analysis leading to this time bound is involved, using techniques of extremal combinatorics to identify worst case instances for the algorithm.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Sports Analytics and Performance · AI-based Problem Solving and Planning
