Playing Hex and Counter Wargames using Reinforcement Learning and   Recurrent Neural Networks

Guilherme Palma; Pedro A. Santos; Jo\~ao Dias

arXiv:2502.13918·cs.LG·February 20, 2025

Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural Networks

Guilherme Palma, Pedro A. Santos, Jo\~ao Dias

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel reinforcement learning system combining Recurrent Neural Networks and AlphaZero to master complex Hex and Counter Wargames, addressing their strategic intricacies and scalability challenges.

Contribution

It introduces a new neural network architecture and state-action representations tailored for complex wargame environments, enabling effective learning with minimal training.

Findings

01

Promising results in typical scenarios

02

Generalization across different terrains and tactics

03

Potential scalability to larger maps

Abstract

Hex and Counter Wargames are adversarial two-player simulations of real military conflicts requiring complex strategic decision-making. Unlike classical board games, these games feature intricate terrain/unit interactions, unit stacking, large maps of varying sizes, and simultaneous move and combat decisions involving hundreds of units. This paper introduces a novel system designed to address the strategic complexity of Hex and Counter Wargames by integrating cutting-edge advancements in Recurrent Neural Networks with AlphaZero, a reliable modern Reinforcement Learning algorithm. The system utilizes a new Neural Network architecture developed from existing research, incorporating innovative state and action representations tailored to these specific game environments. With minimal training, our solution has shown promising results in typical scenarios, demonstrating the ability to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

guilherme439/nuzero
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTerrorism, Counterterrorism, and Political Violence · Network Security and Intrusion Detection · Crime, Illicit Activities, and Governance

MethodsAlphaZero