Efficient Sparse-to-Dense Visual Localization via Compact Gaussian Scene Representation and Accelerated Dense Pose Estimation
Zizhuo Li, Songchu Deng, Linfeng Tang, and Jiayi Ma

TL;DR
LiteLoc is an efficient visual localization method that reduces memory and computation by removing redundant features and condensing matches, achieving faster performance without sacrificing accuracy.
Contribution
The paper introduces a color-free, compact Gaussian scene representation and a match condensing strategy, significantly improving efficiency over previous methods.
Findings
Eliminates approximately 94% of redundant storage without losing localization accuracy.
Achieves nearly 19-fold speedup in dense pose estimation.
Outperforms previous state-of-the-art in multiple scenes with enhanced efficiency.
Abstract
This letter presents LiteLoc, a novel and efficient localizer built on 3D Gaussian Splatting (3DGS). The previous state-of-the-art (SoTA) sparse-to-dense localizer, STDLoc, has shown remarkable localization capability but suffers from severe storage redundancy and computational latency. By revisiting its design decisions, we derive two simple yet highly effective improvements that cumulatively make LiteLoc much more efficient in both memory and computation, while also being easier to train. One key observation is that the color field, inherited directly from Feature 3DGS, is functionally useless for localization. Yet, its reconstruction of high-frequency photometric details necessitates excessive Gaussian primitives, resulting in a tightly coupled color-feature representation with significant memory overhead and sub-optimal feature field optimization. To resolve this, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
