FloNa: Floor Plan Guided Embodied Visual Navigation

Jiaxin Li; Weiqi Huang; Zan Wang; Wei Liang; Huijun Di; Feng Liu

arXiv:2412.18335·cs.RO·March 10, 2025

FloNa: Floor Plan Guided Embodied Visual Navigation

Jiaxin Li, Weiqi Huang, Zan Wang, Wei Liang, Huijun Di, Feng Liu

PDF

Open Access 1 Video

TL;DR

FloNa introduces a new embodied visual navigation task that leverages floor plans to improve navigation efficiency and accuracy, addressing challenges of spatial inconsistency and modality alignment with a diffusion policy framework.

Contribution

This work pioneers the integration of floor plan prior knowledge into embodied visual navigation and proposes FloDiff, a diffusion-based policy with localization for better scene alignment.

Findings

01

FloNa significantly improves navigation performance in unfamiliar scenes.

02

The framework effectively handles spatial and modality discrepancies.

03

Extensive experiments validate the approach's efficiency and robustness.

Abstract

Humans naturally rely on floor plans to navigate in unfamiliar environments, as they are readily available, reliable, and provide rich geometrical guidance. However, existing visual navigation settings overlook this valuable prior knowledge, leading to limited efficiency and accuracy. To eliminate this gap, we introduce a novel navigation task: Floor Plan Visual Navigation (FloNa), the first attempt to incorporate floor plan into embodied visual navigation. While the floor plan offers significant advantages, two key challenges emerge: (1) handling the spatial inconsistency between the floor plan and the actual scene layout for collision-free navigation, and (2) aligning observed images with the floor plan sketch despite their distinct modalities. To address these challenges, we propose FloDiff, a novel diffusion policy framework incorporating a localization module to facilitate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

FloNa: Floor Plan Guided Embodied Visual Navigation· underline

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · Video Analysis and Summarization

MethodsDiffusion