Lookahead Exploration with Neural Radiance Representation for Continuous   Vision-Language Navigation

Zihan Wang; Xiangyang Li; Jiahao Yang; Yeqi Liu; Junjie Hu; Ming; Jiang; Shuqiang Jiang

arXiv:2404.01943·cs.CV·April 3, 2024·1 cites

Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation

Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Junjie Hu, Ming, Jiang, Shuqiang Jiang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel hierarchical neural radiance model for vision-language navigation that improves future environment prediction, enabling more effective lookahead exploration and path planning in 3D environments.

Contribution

It proposes a pre-trained hierarchical neural radiance model to produce robust semantic features for future environments, enhancing lookahead navigation planning.

Findings

01

Outperforms existing methods on VLN-CE datasets

02

Efficient parallel evaluation of future paths

03

Robust semantic feature representation for environments

Abstract

Vision-and-language navigation (VLN) enables the agent to navigate to a remote location following the natural language instruction in 3D environments. At each navigation step, the agent selects from possible candidate locations and then makes the move. For better navigation planning, the lookahead exploration strategy aims to effectively evaluate the agent's next action by accurately anticipating the future environment of candidate locations. To this end, some existing works predict RGB images for future environments, while this strategy suffers from image distortion and high computational cost. To address these issues, we propose the pre-trained hierarchical neural radiance representation model (HNR) to produce multi-level semantic features for future environments, which are more robust and efficient than pixel-wise RGB reconstruction. Furthermore, with the predicted future…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mrzihan/hnr-vln
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Automated Systems