PRET: Planning with Directed Fidelity Trajectory for Vision and Language   Navigation

Renjie Lu; Jingke Meng; Wei-Shi Zheng

arXiv:2407.11487·cs.CV·July 17, 2024

PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation

Renjie Lu, Jingke Meng, Wei-Shi Zheng

PDF

Open Access 1 Repo

TL;DR

This paper introduces PRET, a novel navigation planning method that aligns instructions with directed fidelity trajectories on a directed graph, achieving high performance with reduced computational cost in vision and language navigation tasks.

Contribution

The paper proposes a new trajectory representation and alignment strategy for navigation planning that improves efficiency and performance over existing methods.

Findings

01

Outperforms SOTA BEVBert on RxR dataset

02

Achieves comparable results on R2R dataset

03

Significantly reduces computational cost

Abstract

Vision and language navigation is a task that requires an agent to navigate according to a natural language instruction. Recent methods predict sub-goals on constructed topology map at each step to enable long-term action planning. However, they suffer from high computational cost when attempting to support such high-level predictions with GCN-like models. In this work, we propose an alternative method that facilitates navigation planning by considering the alignment between instructions and directed fidelity trajectories, which refers to a path from the initial node to the candidate locations on a directed graph without detours. This planning strategy leads to an efficient model while achieving strong performance. Specifically, we introduce a directed graph to illustrate the explored area of the environment, emphasizing directionality. Then, we firstly define the trajectory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

isee-laboratory/vln-pret
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Automated Systems