How to Train PointGoal Navigation Agents on a (Sample and Compute)   Budget

Erik Wijmans; Irfan Essa; Dhruv Batra

arXiv:2012.06117·cs.CV·December 14, 2020·6 cites

How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget

Erik Wijmans, Irfan Essa, Dhruv Batra

PDF

Open Access

TL;DR

This paper investigates training PointGoal navigation agents efficiently within limited sample and compute budgets, identifying key design choices that significantly improve performance across popular benchmarks.

Contribution

It provides a comprehensive analysis of training strategies and hyper-parameters that enhance PointGoal navigation performance under constrained resources.

Findings

01

Performance improved by up to 38% on Gibson and 220% on Matterport3D.

02

Key design choices include advantage estimation, visual encoder architecture, and hyper-parameters.

03

Extensive experiments totaling over 50,000 GPU-hours support the findings.

Abstract

PointGoal navigation has seen significant recent interest and progress, spurred on by the Habitat platform and associated challenge. In this paper, we study PointGoal navigation under both a sample budget (75 million frames) and a compute budget (1 GPU for 1 day). We conduct an extensive set of experiments, cumulatively totaling over 50,000 GPU-hours, that let us identify and discuss a number of ostensibly minor but significant design choices -- the advantage estimation procedure (a key component in training), visual encoder architecture, and a seemingly minor hyper-parameter change. Overall, these design choices to lead considerable and consistent improvements over the baselines present in Savva et al. Under a sample budget, performance for RGB-D agents improves 8 SPL on Gibson (14% relative improvement) and 20 SPL on Matterport3D (38% relative improvement). Under a compute budget,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Robotics and Sensor-Based Localization