Auxiliary Tasks Speed Up Learning PointGoal Navigation

Joel Ye; Dhruv Batra; Erik Wijmans; Abhishek Das

arXiv:2007.04561·cs.CV·November 6, 2020·6 cites

Auxiliary Tasks Speed Up Learning PointGoal Navigation

Joel Ye, Dhruv Batra, Erik Wijmans, Abhishek Das

PDF

Open Access 1 Repo

TL;DR

This paper introduces an efficient learning method for PointGoal Navigation that leverages self-supervised auxiliary tasks and attention mechanisms, significantly reducing training time and improving performance over previous approaches.

Contribution

It proposes a novel combination of auxiliary tasks with attention to enhance sample efficiency and speed up PointNav learning, achieving state-of-the-art results with fewer frames.

Findings

01

5.5x faster training to reach previous SOTA performance

02

Improved SPL by 0.16 at 40M frames

03

Auxiliary tasks combined with attention outperform naive methods

Abstract

PointGoal Navigation is an embodied task that requires agents to navigate to a specified point in an unseen environment. Wijmans et al. showed that this task is solvable but their method is computationally prohibitive, requiring 2.5 billion frames and 180 GPU-days. In this work, we develop a method to significantly increase sample and time efficiency in learning PointNav using self-supervised auxiliary tasks (e.g. predicting the action taken between two egocentric observations, predicting the distance between two observations from a trajectory,etc.).We find that naively combining multiple auxiliary tasks improves sample efficiency,but only provides marginal gains beyond a point. To overcome this, we use attention to combine representations learnt from individual auxiliary tasks. Our best agent is 5.5x faster to reach the performance of the previous state-of-the-art, DD-PPO, at 40M…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

joel99/habitat-pointnav-aux
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling

MethodsDecentralized Distributed Proximal Policy Optimization