Strategy complexity of finite-horizon Markov decision processes and   simple stochastic games

Krishnendu Chatterjee; Rasmus Ibsen-Jensen

arXiv:1209.3617·cs.GT·September 18, 2012

Strategy complexity of finite-horizon Markov decision processes and simple stochastic games

Krishnendu Chatterjee, Rasmus Ibsen-Jensen

PDF

Open Access

TL;DR

This paper analyzes the strategy complexity of finite-horizon MDPs and SSGs, providing asymptotically optimal bounds on memory requirements and revealing sub-exponential lower bounds on the period of optimal strategies.

Contribution

It establishes tight bounds on the memory size needed for strategies in finite-horizon MDPs and SSGs and investigates the periodic properties of optimal strategies.

Findings

01

Counter-based strategies require at most log log (1/ε) + n+1 memory states.

02

Memory of size Ω(log log (1/ε) + n) is necessary.

03

Optimal strategies have a sub-exponential lower bound on their period.

Abstract

Markov decision processes (MDPs) and simple stochastic games (SSGs) provide a rich mathematical framework to study many important problems related to probabilistic systems. MDPs and SSGs with finite-horizon objectives, where the goal is to maximize the probability to reach a target state in a given finite time, is a classical and well-studied problem. In this work we consider the strategy complexity of finite-horizon MDPs and SSGs. We show that for all $ϵ > 0$ , the natural class of counter-based strategies require at most $lo g lo g (\frac{1}{ϵ}) + n + 1$ memory states, and memory of size $Ω (lo g lo g (\frac{1}{ϵ}) + n)$ is required. Thus our bounds are asymptotically optimal. We then study the periodic property of optimal strategies, and show a sub-exponential lower bound on the period for optimal strategies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Bayesian Modeling and Causal Inference · Formal Methods in Verification