Perspective from a Broader Context: Can Room Style Knowledge Help Visual Floorplan Localization?
Bolei Chen, Shengsheng Yan, Yongzheng Cui, Jiaxu Kang, Ping Zhong, Jianxin Wang

TL;DR
This paper introduces a novel approach for visual floorplan localization that leverages broader scene context and room style knowledge through unsupervised learning, significantly improving robustness and accuracy over existing methods.
Contribution
It proposes an unsupervised pre-training method using a room discriminator to incorporate scene context into FLoc algorithms, enhancing localization performance.
Findings
Outperforms state-of-the-art methods on benchmark datasets.
Achieves significant improvements in robustness and accuracy.
Effectively utilizes scene context and room style knowledge.
Abstract
Since a building's floorplan remains consistent over time and is inherently robust to changes in visual appearance, visual Floorplan Localization (FLoc) has received increasing attention from researchers. However, as a compact and minimalist representation of the building's layout, floorplans contain many repetitive structures (e.g., hallways and corners), thus easily result in ambiguous localization. Existing methods either pin their hopes on matching 2D structural cues in floorplans or rely on 3D geometry-constrained visual pre-trainings, ignoring the richer contextual information provided by visual images. In this paper, we suggest using broader visual scene context to empower FLoc algorithms with scene layout priors to eliminate localization uncertainty. In particular, we propose an unsupervised learning technique with clustering constraints to pre-train a room discriminator on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
