HGSLoc: 3DGS-based Heuristic Camera Pose Refinement

Zhongyan Niu; Zhen Tan; Jinpu Zhang; Xueliang Yang; Dewen Hu

arXiv:2409.10925·cs.CV·September 9, 2025

HGSLoc: 3DGS-based Heuristic Camera Pose Refinement

Zhongyan Niu, Zhen Tan, Jinpu Zhang, Xueliang Yang, Dewen Hu

PDF

Open Access

TL;DR

HGSLoc is a lightweight, plug-and-play framework that combines 3D reconstruction and heuristic refinement to improve camera pose accuracy in visual localization, outperforming neural network-based methods especially in noisy and challenging environments.

Contribution

The paper introduces HGSLoc, a novel visual localization method that integrates explicit 3D geometric maps with heuristic refinement for improved accuracy and robustness.

Findings

01

Higher localization accuracy than NeRF-based methods

02

Robust performance in noisy and challenging environments

03

Effective in multiple benchmark datasets

Abstract

Visual localization refers to the process of determining camera poses and orientation within a known scene representation. This task is often complicated by factors such as changes in illumination and variations in viewing angles. In this paper, we propose HGSLoc, a novel lightweight plug-and-play pose optimization framework, which integrates 3D reconstruction with a heuristic refinement strategy to achieve higher pose estimation accuracy. Specifically, we introduce an explicit geometric map for 3D representation and high-fidelity rendering, allowing the generation of high-quality synthesized views to support accurate visual localization. Our method demonstrates higher localization accuracy compared to NeRF-based neural rendering localization approaches. We introduce a heuristic refinement strategy, its efficient optimization capability can quickly locate the target node, while we set…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Augmented Reality Applications

MethodsSparse Evolutionary Training · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings