Generalized Rapid Action Value Estimation in Memory-Constrained Environments
Alo\"is Rautureau, Tristan Cazenave, \'Eric Piette

TL;DR
This paper introduces new variants of the GRAVE algorithm that significantly reduce memory usage in Monte-Carlo Tree Search for General Game Playing, while maintaining high performance.
Contribution
The paper proposes GRAVE2, GRAVER, and GRAVER2 algorithms that extend GRAVE with memory-efficient techniques like two-level search and node recycling.
Findings
Reduced memory requirements while maintaining performance
Matching GRAVE's playing strength with fewer stored nodes
Effective in memory-constrained environments
Abstract
Generalized Rapid Action Value Estimation (GRAVE) has been shown to be a strong variant within the Monte-Carlo Tree Search (MCTS) family of algorithms for General Game Playing (GGP). However, its reliance on storing additional win/visit statistics at each node makes its use impractical in memory-constrained environments, thereby limiting its applicability in practice. In this paper, we introduce the GRAVE2, GRAVER and GRAVER2 algorithms, which extend GRAVE through two-level search, node recycling, and a combination of both techniques, respectively. We show that these enhancements enable a drastic reduction in the number of stored nodes while matching the playing strength of GRAVE.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Digital Games and Media
