COMPASS: COmpact Multi-channel Prior-map And Scene Signature for Floor-Plan-Based Visual Localization

Muhammad Shaheer; Miguel Fernandez-Cortizas; Asier Bikandi-Noya; Holger Voos; Jose Luis Sanchez-Lopez

arXiv:2604.25388·cs.CV·April 29, 2026

COMPASS: COmpact Multi-channel Prior-map And Scene Signature for Floor-Plan-Based Visual Localization

Muhammad Shaheer, Miguel Fernandez-Cortizas, Asier Bikandi-Noya, Holger Voos, Jose Luis Sanchez-Lopez

PDF

TL;DR

COMPASS is a novel localization algorithm that combines geometric and semantic information from floor plans and fisheye images to accurately estimate a robot's pose.

Contribution

The paper introduces a multi-channel radial descriptor for cross-modal matching of floor plans and fisheye images, incorporating semantic elements like windows and walls.

Findings

01

The descriptor effectively encodes geometric and semantic environment features.

02

Cross-modal matching between floor plans and fisheye images is feasible.

03

Initial experiments validate the structural matching approach.

Abstract

Architectural floor plans are widely available priors which contain not only geometry but also the semantic information of the environment, yet existing localization methods largely ignore this semantic information. To address this, we present COMPASS, an algorithm that exploits both geometric and semantic priors from floor plans to estimate the pose of a robot equipped with dual fisheye cameras. Inspired by scan context descriptor from LiDAR-based place recognition, we design a multi-channel radial descriptor that encodes the geometric layout surrounding a position. From the floor plan, rays are cast in 360 azimuth bins and the results are encoded into five channels: normalized range, structural hit type (wall, window, or opening), range gradient, inverse range, and local range variance. From the image side, the same descriptor structure is populated by detecting structural elements in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.