Episodic Exploration for Deep Deterministic Policies: An Application to   StarCraft Micromanagement Tasks

Nicolas Usunier; Gabriel Synnaeve; Zeming Lin; Soumith Chintala

arXiv:1609.02993·cs.AI·November 29, 2016·102 cites

Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks

Nicolas Usunier, Gabriel Synnaeve, Zeming Lin, Soumith Chintala

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach with deep neural networks and a novel exploration algorithm to master micromanagement tasks in StarCraft, demonstrating success in complex, large-scale scenarios.

Contribution

It presents a new heuristic reinforcement learning algorithm combining policy exploration and backpropagation, tailored for large state-action spaces in real-time strategy games.

Findings

01

Successfully learned strategies for up to 15 agents.

02

Outperformed Q-learning and REINFORCE in these tasks.

03

Demonstrated the effectiveness of direct policy exploration.

Abstract

We consider scenarios from the real-time strategy game StarCraft as new benchmarks for reinforcement learning algorithms. We propose micromanagement tasks, which present the problem of the short-term, low-level control of army members during a battle. From a reinforcement learning point of view, these scenarios are challenging because the state-action space is very large, and because there is no obvious feature representation for the state-action evaluation function. We describe our approach to tackle the micromanagement scenarios with deep neural network controllers from raw state features given by the game engine. In addition, we present a heuristic reinforcement learning algorithm which combines direct exploration in the policy space and backpropagation. This algorithm allows for the collection of traces for learning using deterministic policies, which appears much more efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Sports Analytics and Performance

MethodsREINFORCE · Q-Learning