Learning Macromanagement in StarCraft from Replays using Deep Learning

Niels Justesen; Sebastian Risi

arXiv:1707.03743·cs.AI·July 13, 2017

Learning Macromanagement in StarCraft from Replays using Deep Learning

Niels Justesen, Sebastian Risi

PDF

TL;DR

This paper demonstrates that deep learning can effectively learn macromanagement decisions in StarCraft directly from game replays, enabling the development of more adaptable and less hand-crafted AI agents.

Contribution

It introduces a novel approach of training neural networks on replays to learn macromanagement, outperforming built-in bots and paving the way for future reinforcement learning improvements.

Findings

01

Neural networks achieved 54.6% top-1 error in predicting build actions.

02

The system outperforms the game's built-in Terran bot.

03

The approach is the first to learn macromanagement directly from replays in StarCraft.

Abstract

The real-time strategy game StarCraft has proven to be a challenging environment for artificial intelligence techniques, and as a result, current state-of-the-art solutions consist of numerous hand-crafted modules. In this paper, we show how macromanagement decisions in StarCraft can be learned directly from game replays using deep learning. Neural networks are trained on 789,571 state-action pairs extracted from 2,005 replays of highly skilled players, achieving top-1 and top-3 error rates of 54.6% and 22.9% in predicting the next build action. By integrating the trained network into UAlbertaBot, an open source StarCraft bot, the system can significantly outperform the game's built-in Terran bot, and play competitively against UAlbertaBot with a fixed rush strategy. To our knowledge, this is the first time macromanagement tasks are learned directly from replays in StarCraft. While the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.