Faster O(|V|^2|E|W)-Time Energy Algorithms for Optimal Strategy Synthesis in Mean Payoff Games
Carlo Comin, Romeo Rizzi

TL;DR
This paper introduces a faster deterministic algorithm for solving the Value Problem and Optimal Strategy Synthesis in Mean Payoff Games, improving pseudo-polynomial time complexity and exploring the structure of optimal strategies via energy measures.
Contribution
It presents a new $O(|V|^2|E|W)$ pseudo-polynomial time algorithm for MPG strategy synthesis and analyzes the energy-based decomposition of optimal strategies.
Findings
Improved pseudo-polynomial time complexity to $O(|V|^2|E|W)$
Decomposition of optimal strategies into extremal-SEPMs
Recursive procedure for enumerating energy lattice elements
Abstract
This study strengthens the links between Mean Payoff Games (\MPG{s}) and Energy Games (EG{s}). Firstly, we offer a faster pseudo-polynomial time and space deterministic algorithm for solving the Value Problem and Optimal Strategy Synthesis in \MPG{s}. This improves the best previously known estimates on the pseudo-polynomial time complexity to: \[ O(|E|\log |V|) + \Theta\Big(\sum_{v\in V}\texttt{deg}_{\Gamma}(v)\cdot\ell_{\Gamma}(v)\Big) = O(|V|^2|E|W), \] where counts the number of times that a certain energy-lifting operator is applied to any , along a certain sequence of Value-Iterations on reweighted \EG{s}; and is the degree of . This improves significantly over a previously known pseudo-polynomial time estimate, i.e. $\Theta\big(|V|^2|E|W + \sum_{v\in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research
