MultiLoc: Multi-view Guided Relative Pose Regression for Fast and Robust Visual Re-Localization
Nobel Dang, Bing Li

TL;DR
MultiLoc introduces a multi-view guided relative pose regression model that fuses multiple reference views for fast, accurate, and robust visual re-localization across diverse environments, outperforming existing methods.
Contribution
The paper presents a novel multi-view guided RPR model with a co-visibility retrieval strategy, achieving state-of-the-art results in visual re-localization and relative pose estimation.
Findings
Outperforms SOTA RPR methods on multiple datasets.
Achieves real-time, zero-shot pose estimation.
Demonstrates robust generalization across indoor and outdoor environments.
Abstract
Relative Pose Regression (RPR) generalizes well to unseen environments, but its performance is often limited due to pairwise and local spatial views. To this end, we propose MultiLoc, a novel multi-view guided RPR model trained at scale, equipping relative pose regression with globally consistent spatial and geometric understanding. Specifically, our method jointly fuses multiple reference views and their associated camera poses in a single forward pass, enabling accurate zero-shot pose estimation with real-time efficiency. To reliably supply informative context, we further propose a co-visibility-driven retrieval strategy for geometrically relevant reference view selection. MultiLoc establishes a new benchmark in visual re-localization, consistently outperforming existing state-of-the-art (SOTA) relative pose regression (RPR) methods across diverse datasets, including WaySpots,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
