R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale   Visual Localization

Xudong Jiang; Fangjinhua Wang; Silvano Galliani; Christoph Vogel; Marc; Pollefeys

arXiv:2501.01421·cs.CV·April 14, 2025

R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization

Xudong Jiang, Fangjinhua Wang, Silvano Galliani, Christoph Vogel, Marc, Pollefeys

PDF

Open Access 1 Repo

TL;DR

This paper enhances scene coordinate regression for visual localization by introducing novel encoding, data augmentation, and architecture improvements, achieving state-of-the-art accuracy on large-scale, challenging datasets with smaller map sizes.

Contribution

It presents a covisibility graph-based encoding, depth-adjusted loss, and architecture revisits that significantly improve SCR robustness and accuracy without large maps or 3D supervision.

Findings

01

Achieves 10× higher accuracy on Aachen Day-Night compared to previous SCR methods.

02

Requires at least 5× smaller map sizes while maintaining superior accuracy.

03

State-of-the-art performance on large-scale datasets without ensemble or 3D supervision.

Abstract

Learning-based visual localization methods that use scene coordinate regression (SCR) offer the advantage of smaller map sizes. However, on datasets with complex illumination changes or image-level ambiguities, it remains a less robust alternative to feature matching methods. This work aims to close the gap. We introduce a covisibility graph-based global encoding learning and data augmentation strategy, along with a depth-adjusted reprojection loss to facilitate implicit triangulation. Additionally, we revisit the network architecture and local feature extraction module. Our method achieves state-of-the-art on challenging large-scale datasets without relying on network ensembles or 3D supervision. On Aachen Day-Night, we are 10 $\times$ more accurate than previous SCR methods with similar map sizes and require at least 5 $\times$ smaller map sizes than any other SCR method while still…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cvg/scrstudio
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Vision and Imaging · Image Retrieval and Classification Techniques