Combining Learning from Human Feedback and Knowledge Engineering to   Solve Hierarchical Tasks in Minecraft

Vinicius G. Goecks; Nicholas Waytowich; David Watkins-Valls; Bharat; Prakash

arXiv:2112.03482·cs.LG·May 13, 2022·1 cites

Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft

Vinicius G. Goecks, Nicholas Waytowich, David Watkins-Valls, Bharat, Prakash

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents a hybrid approach combining imitation learning, human feedback, and knowledge engineering to solve complex hierarchical tasks in Minecraft, winning the 2021 NeurIPS MineRL BASALT Challenge.

Contribution

It introduces a novel hybrid method that integrates human demonstration data, feedback, and engineered modules for hierarchical task solving in a complex environment.

Findings

01

Achieved first place in the NeurIPS MineRL BASALT Challenge

02

Outperformed end-to-end machine learning approaches

03

Provided a human-like, interpretable agent solution

Abstract

Real-world tasks of interest are generally poorly defined by human-readable descriptions and have no pre-defined reward signals unless it is defined by a human designer. Conversely, data-driven algorithms are often designed to solve a specific, narrowly defined, task with performance metrics that drives the agent's learning. In this work, we present the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge: Learning from Human Feedback in Minecraft, which challenged participants to use human data to solve four tasks defined only by a natural language description and no reward function. Our approach uses the available human demonstration data to train an imitation learning policy for navigation and additional human feedback to train an image classifier. These modules, combined with an estimated odometry map, become…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

viniciusguigo/kairos_minerl_basalt
pytorchOfficial

Videos

This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors)· youtube

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications · Reinforcement Learning in Robotics