Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural Networks
Guilherme Palma, Pedro A. Santos, Jo\~ao Dias

TL;DR
This paper presents a novel reinforcement learning system combining Recurrent Neural Networks and AlphaZero to master complex Hex and Counter Wargames, addressing their strategic intricacies and scalability challenges.
Contribution
It introduces a new neural network architecture and state-action representations tailored for complex wargame environments, enabling effective learning with minimal training.
Findings
Promising results in typical scenarios
Generalization across different terrains and tactics
Potential scalability to larger maps
Abstract
Hex and Counter Wargames are adversarial two-player simulations of real military conflicts requiring complex strategic decision-making. Unlike classical board games, these games feature intricate terrain/unit interactions, unit stacking, large maps of varying sizes, and simultaneous move and combat decisions involving hundreds of units. This paper introduces a novel system designed to address the strategic complexity of Hex and Counter Wargames by integrating cutting-edge advancements in Recurrent Neural Networks with AlphaZero, a reliable modern Reinforcement Learning algorithm. The system utilizes a new Neural Network architecture developed from existing research, incorporating innovative state and action representations tailored to these specific game environments. With minimal training, our solution has shown promising results in typical scenarios, demonstrating the ability to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTerrorism, Counterterrorism, and Political Violence · Network Security and Intrusion Detection · Crime, Illicit Activities, and Governance
MethodsAlphaZero
