The MineRL 2019 Competition on Sample Efficient Reinforcement Learning   using Human Priors

William H. Guss; Cayden Codel; Katja Hofmann; Brandon Houghton; Noboru; Kuno; Stephanie Milani; Sharada Mohanty; Diego Perez Liebana; Ruslan; Salakhutdinov; Nicholay Topin; Manuela Veloso; Phillip Wang

arXiv:1904.10079·cs.LG·January 20, 2021·47 cites

The MineRL 2019 Competition on Sample Efficient Reinforcement Learning using Human Priors

William H. Guss, Cayden Codel, Katja Hofmann, Brandon Houghton, Noboru, Kuno, Stephanie Milani, Sharada Mohanty, Diego Perez Liebana, Ruslan, Salakhutdinov, Nicholay Topin, Manuela Veloso, Phillip Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces the MineRL competition aimed at developing sample-efficient reinforcement learning algorithms that leverage human demonstrations to solve complex tasks in Minecraft, addressing the challenge of high sample requirements.

Contribution

It presents a new environment, the Minecraft ObtainDiamond task, and a large-scale human demonstration dataset to facilitate research on sample-efficient RL methods using human priors.

Findings

01

Development of algorithms that reduce environment samples needed

02

Creation of the MineRL-v0 dataset with 60 million demonstrations

03

Establishment of a competitive framework for benchmarking sample-efficient RL

Abstract

Though deep reinforcement learning has led to breakthroughs in many difficult domains, these successes have required an ever-increasing number of samples. As state-of-the-art reinforcement learning (RL) systems require an exponentially increasing number of samples, their development is restricted to a continually shrinking segment of the AI community. Likewise, many of these systems cannot be applied to real-world problems, where environment samples are expensive. Resolution of these limitations requires new, sample-efficient methods. To facilitate research in this direction, we introduce the MineRL Competition on Sample Efficient Reinforcement Learning using Human Priors. The primary goal of the competition is to foster the development of algorithms which can efficiently leverage human demonstrations to drastically reduce the number of samples needed to solve complex, hierarchical,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shadowyzy/NIPS2019-MineRL-Competition-solution
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms