Enhancing Exploratory Capability of Visual Navigation Using Uncertainty of Implicit Scene Representation
Yichen Wang, Qiming Liu, Zhe Liu, and Hesheng Wang

TL;DR
This paper introduces NUE, a novel visual navigation pipeline that leverages uncertainty in implicit scene representations to improve exploration and target search in unknown environments, outperforming existing methods.
Contribution
The paper proposes an end-to-end NUE pipeline that uses NeRF-based uncertainty estimation to enhance exploration and reasoning in visual navigation tasks.
Findings
Enhanced exploratory behaviors demonstrated in experiments.
Improved navigation performance over existing memory-based methods.
Seamless transition from exploration to exploitation phases.
Abstract
In the context of visual navigation in unknown scenes, both "exploration" and "exploitation" are equally crucial. Robots must first establish environmental cognition through exploration and then utilize the cognitive information to accomplish target searches. However, most existing methods for image-goal navigation prioritize target search over the generation of exploratory behavior. To address this, we propose the Navigation with Uncertainty-driven Exploration (NUE) pipeline, which uses an implicit and compact scene representation, NeRF, as a cognitive structure. We estimate the uncertainty of NeRF and augment the exploratory ability by the uncertainty to in turn facilitate the construction of implicit representation. Simultaneously, we extract memory information from NeRF to enhance the robot's reasoning ability for determining the location of the target. Ultimately, we seamlessly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Spatial Cognition and Navigation · Visual Attention and Saliency Detection
