Monte Carlo Neural Fictitious Self-Play: Approach to Approximate Nash equilibrium of Imperfect-Information Games
Li Zhang, Wei Wang, Shijian Li, Gang Pan

TL;DR
This paper introduces Monte Carlo Neural Fictitious Self Play (MC-NFSP), an advanced algorithm combining Monte Carlo tree search with NFSP to better approximate Nash equilibrium in large-scale imperfect-information games, outperforming previous methods.
Contribution
It proposes MC-NFSP, integrating Monte Carlo tree search with NFSP, and develops ANFSP with asynchronous parallel architecture to enhance convergence and training stability.
Findings
MC-NFSP converges to approximate Nash equilibrium in large-scale games.
MC-NFSP outperforms NFSP in deep search scenarios.
Parallel ANFSP accelerates training and stabilizes learning.
Abstract
Researchers on artificial intelligence have achieved human-level intelligence in large-scale perfect-information games, but it is still a challenge to achieve (nearly) optimal results (in other words, an approximate Nash Equilibrium) in large-scale imperfect-information games (i.e. war games, football coach or business strategies). Neural Fictitious Self Play (NFSP) is an effective algorithm for learning approximate Nash equilibrium of imperfect-information games from self-play without prior domain knowledge. However, it relies on Deep Q-Network, which is off-line and is hard to converge in online games with changing opponent strategy, so it can't approach approximate Nash equilibrium in games with large search scale and deep search depth. In this paper, we propose Monte Carlo Neural Fictitious Self Play (MC-NFSP), an algorithm combines Monte Carlo tree search with NFSP, which greatly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Sports Analytics and Performance · Reinforcement Learning in Robotics
