Non-oblivious Strategy Improvement

John Fearnley

arXiv:1003.2976·cs.GT·May 18, 2015

Non-oblivious Strategy Improvement

John Fearnley

PDF

TL;DR

This paper introduces non-oblivious strategy improvement algorithms for mean-payoff and parity games that leverage structural properties of the game to enhance performance, especially on challenging instances.

Contribution

It presents a novel class of algorithms that remember game structures, improving upon oblivious methods by accelerating convergence on difficult cases.

Findings

01

Non-oblivious algorithms outperform oblivious ones on hard examples.

02

Structural awareness leads to faster strategy improvement.

03

Previous algorithms fail due to ignoring game structures.

Abstract

We study strategy improvement algorithms for mean-payoff and parity games. We describe a structural property of these games, and we show that these structures can affect the behaviour of strategy improvement. We show how awareness of these structures can be used to accelerate strategy improvement algorithms. We call our algorithms non-oblivious because they remember properties of the game that they have discovered in previous iterations. We show that non-oblivious strategy improvement algorithms perform well on examples that are known to be hard for oblivious strategy improvement. Hence, we argue that previous strategy improvement algorithms fail because they ignore the structural properties of the game that they are solving.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.