Is Mapping Necessary for Realistic PointGoal Navigation?
Ruslan Partsey, Erik Wijmans, Naoki Yokoyama, Oles Dobosevych, Dhruv, Batra, Oleksandr Maksymets

TL;DR
This paper investigates whether explicit mapping is necessary for realistic PointGoal navigation, demonstrating that robust visual odometry can significantly improve success rates without relying on explicit maps in complex environments.
Contribution
The authors develop data-augmentation techniques for visual odometry, achieving a new state-of-the-art success rate of 94% on the Habitat Realistic PointNav Challenge, challenging the necessity of explicit mapping.
Findings
Achieved 94% success rate on Habitat Realistic PointNav Challenge
Robust visual odometry can nearly match GPS+Compass performance in realistic settings
Explicit mapping may not be essential for effective PointNav in complex environments
Abstract
Can an autonomous agent navigate in a new environment without building an explicit map? For the task of PointGoal navigation ('Go to , ') under idealized settings (no RGB-D and actuation noise, perfect GPS+Compass), the answer is a clear 'yes' - map-less neural models composed of task-agnostic components (CNNs and RNNs) trained with large-scale reinforcement learning achieve 100% Success on a standard dataset (Gibson). However, for PointNav in a realistic setting (RGB-D and actuation noise, no GPS+Compass), this is an open question; one we tackle in this paper. The strongest published result for this task is 71.7% Success. First, we identify the main (perhaps, only) cause of the drop in performance: the absence of GPS+Compass. An agent with perfect GPS+Compass faced with RGB-D sensing and actuation noise achieves 99.8% Success (Gibson-v2 val). This suggests that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
