The MineRL 2019 Competition on Sample Efficient Reinforcement Learning using Human Priors
William H. Guss, Cayden Codel, Katja Hofmann, Brandon Houghton, Noboru, Kuno, Stephanie Milani, Sharada Mohanty, Diego Perez Liebana, Ruslan, Salakhutdinov, Nicholay Topin, Manuela Veloso, Phillip Wang

TL;DR
This paper introduces the MineRL competition aimed at developing sample-efficient reinforcement learning algorithms that leverage human demonstrations to solve complex tasks in Minecraft, addressing the challenge of high sample requirements.
Contribution
It presents a new environment, the Minecraft ObtainDiamond task, and a large-scale human demonstration dataset to facilitate research on sample-efficient RL methods using human priors.
Findings
Development of algorithms that reduce environment samples needed
Creation of the MineRL-v0 dataset with 60 million demonstrations
Establishment of a competitive framework for benchmarking sample-efficient RL
Abstract
Though deep reinforcement learning has led to breakthroughs in many difficult domains, these successes have required an ever-increasing number of samples. As state-of-the-art reinforcement learning (RL) systems require an exponentially increasing number of samples, their development is restricted to a continually shrinking segment of the AI community. Likewise, many of these systems cannot be applied to real-world problems, where environment samples are expensive. Resolution of these limitations requires new, sample-efficient methods. To facilitate research in this direction, we introduce the MineRL Competition on Sample Efficient Reinforcement Learning using Human Priors. The primary goal of the competition is to foster the development of algorithms which can efficiently leverage human demonstrations to drastically reduce the number of samples needed to solve complex, hierarchical,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms
