Faster O(|V|^2|E|W)-Time Energy Algorithms for Optimal Strategy   Synthesis in Mean Payoff Games

Carlo Comin; Romeo Rizzi

arXiv:1609.01517·cs.DS·September 7, 2016

Faster O(|V|^2|E|W)-Time Energy Algorithms for Optimal Strategy Synthesis in Mean Payoff Games

Carlo Comin, Romeo Rizzi

PDF

Open Access

TL;DR

This paper introduces a faster deterministic algorithm for solving the Value Problem and Optimal Strategy Synthesis in Mean Payoff Games, improving pseudo-polynomial time complexity and exploring the structure of optimal strategies via energy measures.

Contribution

It presents a new $O(|V|^2|E|W)$ pseudo-polynomial time algorithm for MPG strategy synthesis and analyzes the energy-based decomposition of optimal strategies.

Findings

01

Improved pseudo-polynomial time complexity to $O(|V|^2|E|W)$

02

Decomposition of optimal strategies into extremal-SEPMs

03

Recursive procedure for enumerating energy lattice elements

Abstract

This study strengthens the links between Mean Payoff Games (\MPG{s}) and Energy Games (EG{s}). Firstly, we offer a faster $O (∣ V ∣^{2} ∣ E ∣ W)$ pseudo-polynomial time and $Θ (∣ V ∣ + ∣ E ∣)$ space deterministic algorithm for solving the Value Problem and Optimal Strategy Synthesis in \MPG{s}. This improves the best previously known estimates on the pseudo-polynomial time complexity to: \[ O(|E|\log |V|) + \Theta\Big(\sum_{v\in V}\texttt{deg}_{\Gamma}(v)\cdot\ell_{\Gamma}(v)\Big) = O(|V|^2|E|W), \] where $ℓ_{Γ} (v)$ counts the number of times that a certain energy-lifting operator $δ (\cdot, v)$ is applied to any $v \in V$ , along a certain sequence of Value-Iterations on reweighted \EG{s}; and $deg_{Γ} (v)$ is the degree of $v$ . This improves significantly over a previously known pseudo-polynomial time estimate, i.e. $\Theta\big(|V|^2|E|W + \sum_{v\in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research