GSVisLoc: Generalizable Visual Localization for Gaussian Splatting Scene Representations

Fadi Khatib; Dror Moran; Guy Trostianetsky; Yoni Kasten; Meirav Galun; Ronen Basri

arXiv:2508.18242·cs.CV·August 26, 2025

GSVisLoc: Generalizable Visual Localization for Gaussian Splatting Scene Representations

Fadi Khatib, Dror Moran, Guy Trostianetsky, Yoni Kasten, Meirav Galun, Ronen Basri

PDF

TL;DR

GSVisLoc is a novel visual localization method that leverages 3D Gaussian Splatting scene representations to accurately estimate camera poses without retraining or additional data, demonstrating strong performance across diverse scenes.

Contribution

The paper introduces GSVisLoc, a generalizable localization approach that uses 3D Gaussian Splatting for scene representation, enabling accurate pose estimation without retraining.

Findings

01

Achieves competitive localization accuracy on standard benchmarks.

02

Outperforms existing 3DGS-based localization baselines.

03

Generalizes effectively to unseen scenes without retraining.

Abstract

We introduce GSVisLoc, a visual localization method designed for 3D Gaussian Splatting (3DGS) scene representations. Given a 3DGS model of a scene and a query image, our goal is to estimate the camera's position and orientation. We accomplish this by robustly matching scene features to image features. Scene features are produced by downsampling and encoding the 3D Gaussians while image features are obtained by encoding image patches. Our algorithm proceeds in three steps, starting with coarse matching, then fine matching, and finally by applying pose refinement for an accurate final estimate. Importantly, our method leverages the explicit 3DGS scene representation for visual localization without requiring modifications, retraining, or additional reference images. We evaluate GSVisLoc on both indoor and outdoor scenes, demonstrating competitive localization performance on standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.