Analysis of Hannan Consistent Selection for Monte Carlo Tree Search in Simultaneous Move Games
Vojt\v{e}ch Kova\v{r}\'ik, Viliam Lis\'y

TL;DR
This paper investigates the convergence properties of Monte Carlo Tree Search (MCTS) with Hannan consistent selection functions in simultaneous move games, providing theoretical guarantees and empirical evidence for convergence to Nash equilibria.
Contribution
It establishes that with minor modifications, Hannan consistent algorithms in MCTS reliably converge to approximate Nash equilibria in simultaneous move games.
Findings
Hannan consistency alone does not guarantee convergence without modifications.
Minor technical modifications enable convergence to Nash equilibria.
Empirical results support theoretical claims about convergence speed and quality.
Abstract
Monte Carlo Tree Search (MCTS) has recently been successfully used to create strategies for playing imperfect-information games. Despite its popularity, there are no theoretic results that guarantee its convergence to a well-defined solution, such as Nash equilibrium, in these games. We partially fill this gap by analysing MCTS in the class of zero-sum extensive-form games with simultaneous moves but otherwise perfect information. The lack of information about the opponent's concurrent moves already causes that optimal strategies may require randomization. We present theoretic as well as empirical investigation of the speed and quality of convergence of these algorithms to the Nash equilibria. Primarily, we show that after minor technical modifications, MCTS based on any (approximately) Hannan consistent selection function always converges to an (approximate) subgame perfect Nash…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
