FloorPlan-VLN: A New Paradigm for Floor Plan Guided Vision-Language Navigation
Kehan Chen, Yan Huang, Dong An, Jiawei He, Yifei Su, Jing Liu, Nianfeng Liu, Liang Wang

TL;DR
This paper introduces FloorPlan-VLN, a new paradigm for vision-language navigation that leverages semantic floor plans as global spatial priors, significantly improving navigation success rates in complex indoor environments.
Contribution
It presents a new dataset and a novel method, FP-Nav, that effectively utilize floor plans for spatial reasoning in navigation tasks, outperforming existing baselines.
Findings
Over 60% improvement in navigation success rate
Robust performance under actuation drift and floor plan distortions
Validated effectiveness through real-world deployment
Abstract
Existing Vision-Language Navigation (VLN) task requires agents to follow verbose instructions, ignoring some potentially useful global spatial priors, limiting their capability to reason about spatial structures. Although human-readable spatial schematics (e.g., floor plans) are ubiquitous in real-world buildings, current agents lack the cognitive ability to comprehend and utilize them. To bridge this gap, we introduce \textbf{FloorPlan-VLN}, a new paradigm that leverages structured semantic floor plans as global spatial priors to enable navigation with only concise instructions. We first construct the FloorPlan-VLN dataset, which comprises over 10k episodes across 72 scenes. It pairs more than 100 semantically annotated floor plans with Matterport3D-based navigation trajectories and concise instructions that omit step-by-step guidance. Then, we propose a simple yet effective method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robotics and Sensor-Based Localization · Robotic Path Planning Algorithms
