Critic Guided Segmentation of Rewarding Objects in First-Person Views

Andrew Melnik; Augustin Harter; Christian Limberg; Krishan Rana; Niko; Suenderhauf; Helge Ritter

arXiv:2107.09540·cs.CV·July 21, 2021

Critic Guided Segmentation of Rewarding Objects in First-Person Views

Andrew Melnik, Augustin Harter, Christian Limberg, Krishan Rana, Niko, Suenderhauf, Helge Ritter

PDF

1 Repo

TL;DR

This paper introduces a critic-guided segmentation method that learns to identify rewarding objects in first-person images using sparse reward signals, achieving state-of-the-art results in a complex 3D environment.

Contribution

It presents a novel approach that trains a segmentation network solely with critic feedback, without explicit object annotations, for identifying rewarding objects in complex scenes.

Findings

01

Achieved first place in the NeurIPS 2020 MineRL competition.

02

Successfully learned to segment rewarding objects in 3D environments.

03

Demonstrated effectiveness with sparse reward signals.

Abstract

This work discusses a learning approach to mask rewarding objects in images using sparse reward signals from an imitation learning dataset. For that, we train an Hourglass network using only feedback from a critic model. The Hourglass network learns to produce a mask to decrease the critic's score of a high score image and increase the critic's score of a low score image by swapping the masked areas between these two images. We trained the model on an imitation learning dataset from the NeurIPS 2020 MineRL Competition Track, where our model learned to mask rewarding objects in a complex interactive 3D environment with a sparse reward signal. This approach was part of the 1st place winning solution in this competition. Video demonstration and code: https://rebrand.ly/critic-guided-segmentation

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ndrwmlnk/critic-guided-segmentation-of-rewarding-objects-in-first-person-views
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.