PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Renjie Lu, Jingke Meng, Wei-Shi Zheng

TL;DR
This paper introduces PRET, a novel navigation planning method that aligns instructions with directed fidelity trajectories on a directed graph, achieving high performance with reduced computational cost in vision and language navigation tasks.
Contribution
The paper proposes a new trajectory representation and alignment strategy for navigation planning that improves efficiency and performance over existing methods.
Findings
Outperforms SOTA BEVBert on RxR dataset
Achieves comparable results on R2R dataset
Significantly reduces computational cost
Abstract
Vision and language navigation is a task that requires an agent to navigate according to a natural language instruction. Recent methods predict sub-goals on constructed topology map at each step to enable long-term action planning. However, they suffer from high computational cost when attempting to support such high-level predictions with GCN-like models. In this work, we propose an alternative method that facilitates navigation planning by considering the alignment between instructions and directed fidelity trajectories, which refers to a path from the initial node to the candidate locations on a directed graph without detours. This planning strategy leads to an efficient model while achieving strong performance. Specifically, we introduce a directed graph to illustrate the explored area of the environment, emphasizing directionality. Then, we firstly define the trajectory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems
