PoI: A Filter to Extract Pixel of Interest from Novel Views for Scene Coordinate Regression
Feifei Li, Qi Song, Chi Zhang, Hui Shuai, Rui Huang

TL;DR
This paper introduces PoI, a framework that enhances scene coordinate regression for visual localization by refining novel view synthesis with diffusion models and a pixel filtering strategy, improving accuracy and robustness.
Contribution
It presents a novel approach combining diffusion-based view refinement and pixel filtering to improve SCR training with synthetic views, addressing limitations of existing NVS methods.
Findings
Consistently improves localization accuracy on 7Scenes and Cambridge Landmarks.
Achieves state-of-the-art performance with efficient training.
Demonstrates importance of pixel reliability control in synthetic view augmentation.
Abstract
Neural View Synthesis (NVS) techniques such as NeRF and 3D Gaussian Splatting (3DGS) have enabled photorealistic rendering from novel viewpoints and are increasingly used to augment training data for visual localization. However, these methods fundamentally rely on observed geometry and radiance; they interpolate existing information but cannot hallucinate unseen 3D structures or recover missing content under sparse or extreme viewpoints. As a result, rendered views often exhibit blur, structural distortion, or incomplete geometry. While such imperfections may be tolerated by Camera Pose Regression (CPR) methods, they severely degrade Scene Coordinate Regression (SCR), which requires accurate per-pixel 3D supervision. To address this limitation, we introduce PoI (Pixel-of-Interest), a framework that enables effective NVS augmentation for SCR-based localization. We first employ 3DGS to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image Processing Techniques · Advanced Image and Video Retrieval Techniques
