MineRL: A Large-Scale Dataset of Minecraft Demonstrations
William H. Guss, Brandon Houghton, Nicholay Topin, Phillip Wang,, Cayden Codel, Manuela Veloso, Ruslan Salakhutdinov

TL;DR
The paper introduces MineRL, a large-scale, high-quality dataset of human demonstrations in Minecraft, designed to advance reinforcement learning research by providing extensive, structured data for developing sample-efficient algorithms.
Contribution
It presents a novel, scalable data collection scheme for creating a comprehensive Minecraft demonstration dataset, enabling new research directions in reinforcement learning with human data.
Findings
MineRL contains over 60 million state-action pairs.
The dataset demonstrates diversity and hierarchical structure.
Minecraft presents significant challenges for reinforcement learning.
Abstract
The sample inefficiency of standard deep reinforcement learning methods precludes their application to many real-world problems. Methods which leverage human demonstrations require fewer samples but have been researched less. As demonstrated in the computer vision and natural language processing communities, large-scale datasets have the capacity to facilitate research by serving as an experimental and benchmarking platform for new methods. However, existing datasets compatible with reinforcement learning simulators do not have sufficient scale, structure, and quality to enable the further development and evaluation of methods focused on using human examples. Therefore, we introduce a comprehensive, large-scale, simulator-paired dataset of human demonstrations: MineRL. The dataset consists of over 60 million automatically annotated state-action pairs across a variety of related tasks in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Robot Manipulation and Learning
