Generalized Rapid Action Value Estimation in Memory-Constrained Environments

Alo\"is Rautureau; Tristan Cazenave; \'Eric Piette

arXiv:2602.23318·cs.AI·February 27, 2026

Generalized Rapid Action Value Estimation in Memory-Constrained Environments

Alo\"is Rautureau, Tristan Cazenave, \'Eric Piette

PDF

Open Access

TL;DR

This paper introduces new variants of the GRAVE algorithm that significantly reduce memory usage in Monte-Carlo Tree Search for General Game Playing, while maintaining high performance.

Contribution

The paper proposes GRAVE2, GRAVER, and GRAVER2 algorithms that extend GRAVE with memory-efficient techniques like two-level search and node recycling.

Findings

01

Reduced memory requirements while maintaining performance

02

Matching GRAVE's playing strength with fewer stored nodes

03

Effective in memory-constrained environments

Abstract

Generalized Rapid Action Value Estimation (GRAVE) has been shown to be a strong variant within the Monte-Carlo Tree Search (MCTS) family of algorithms for General Game Playing (GGP). However, its reliance on storing additional win/visit statistics at each node makes its use impractical in memory-constrained environments, thereby limiting its applicability in practice. In this paper, we introduce the GRAVE2, GRAVER and GRAVER2 algorithms, which extend GRAVE through two-level search, node recycling, and a combination of both techniques, respectively. We show that these enhancements enable a drastic reduction in the number of stored nodes while matching the playing strength of GRAVE.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Digital Games and Media