HGSLoc: 3DGS-based Heuristic Camera Pose Refinement
Zhongyan Niu, Zhen Tan, Jinpu Zhang, Xueliang Yang, Dewen Hu

TL;DR
HGSLoc is a lightweight, plug-and-play framework that combines 3D reconstruction and heuristic refinement to improve camera pose accuracy in visual localization, outperforming neural network-based methods especially in noisy and challenging environments.
Contribution
The paper introduces HGSLoc, a novel visual localization method that integrates explicit 3D geometric maps with heuristic refinement for improved accuracy and robustness.
Findings
Higher localization accuracy than NeRF-based methods
Robust performance in noisy and challenging environments
Effective in multiple benchmark datasets
Abstract
Visual localization refers to the process of determining camera poses and orientation within a known scene representation. This task is often complicated by factors such as changes in illumination and variations in viewing angles. In this paper, we propose HGSLoc, a novel lightweight plug-and-play pose optimization framework, which integrates 3D reconstruction with a heuristic refinement strategy to achieve higher pose estimation accuracy. Specifically, we introduce an explicit geometric map for 3D representation and high-fidelity rendering, allowing the generation of high-quality synthesized views to support accurate visual localization. Our method demonstrates higher localization accuracy compared to NeRF-based neural rendering localization approaches. We introduce a heuristic refinement strategy, its efficient optimization capability can quickly locate the target node, while we set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Augmented Reality Applications
MethodsSparse Evolutionary Training · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
