Using Graph Convolutional Networks and TD($\lambda$) to play the game of Risk
Jamie Carr

TL;DR
This paper presents D.A.D, a reinforcement learning-based Risk AI using graph convolutional networks and TD($b$), achieving significantly improved performance over existing built-in AIs by minimizing handcrafted features.
Contribution
Introduces a novel Risk-playing AI that combines GCNs and TD(b$) with minimal feature engineering, addressing game randomness and complexity.
Findings
AI wins 35% of games versus top built-in AI
Uses GCNs to automatically extract features from game states
Effective handling of non-determinism in Risk gameplay
Abstract
Risk is 6 player game with significant randomness and a large game-tree complexity which poses a challenge to creating an agent to play the game effectively. Previous AIs focus on creating high-level handcrafted features determine agent decision making. In this project, I create D.A.D, A Risk agent using temporal difference reinforcement learning to train a Deep Neural Network including a Graph Convolutional Network to evaluate player positions. This is used in a game-tree to select optimal moves. This allows minimal handcrafting of knowledge into the AI, assuring input features are as low-level as possible to allow the network to extract useful and sophisticated features itself, even with the network starting from a random initialisation. I also tackle the issue of non-determinism in Risk by introducing a new method of interpreting attack moves necessary for the search. The result is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Sports Analytics and Performance
