SceneAligner: 3D-Grounded Floorplan Localization in the Wild

Junhyeong Cho; Ruojin Cai; Hadar Averbuch-Elor

arXiv:2605.22581·cs.CV·May 22, 2026

SceneAligner: 3D-Grounded Floorplan Localization in the Wild

Junhyeong Cho, Ruojin Cai, Hadar Averbuch-Elor

PDF

1 Repo

TL;DR

This paper introduces SceneAligner, a novel method for localizing within floorplans in large-scale, real-world environments by leveraging 3D scene reconstruction and cross-modal learning.

Contribution

It presents a 3D-grounded approach that aligns reconstructed scene density maps with rasterized floorplans, enabling localization in unconstrained, large-scale settings.

Findings

01

Significant accuracy improvements over prior methods.

02

Effective localization with as few as one input image.

03

Robust performance in large-scale, real-world environments.

Abstract

Many public buildings provide floorplans with a "you are here" indicator to help visitors orient themselves. Floorplan localization seeks to computationally replicate this capability by determining where visual observations were captured within a floorplan. However, existing methods typically assume controlled small-scale environments and precise vectorized floorplans, limiting their ability to operate in large-scale buildings and rasterized floorplans. In this work, we present an approach for performing floorplan localization in the wild by grounding the task in a reconstructed 3D representation of the scene. Given an unconstrained image collection, our method reconstructs a gravity-aligned 3D scene and projects it into a 2D density map that serves as a floorplan proxy. Floorplan localization is then formulated as aligning this proxy with the input floorplan via a 2D similarity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cornell-vailab/SceneAligner
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.