PixSelect: Less but Reliable Pixels for Accurate and Efficient Localization
Mohammad Altillawi

TL;DR
PixSelect introduces a sparse pixel selection approach that filters out unreliable image regions, leading to more accurate and efficient camera localization from a single RGB image, outperforming state-of-the-art methods.
Contribution
The paper proposes a novel sparse pixel selection method that improves localization accuracy and efficiency by excluding non-discriminative regions, acting as an outlier filter.
Findings
Outperforms state-of-the-art on Cambridge Landmarks dataset.
Achieves higher accuracy with fewer correspondences.
Runs faster than methods using pose priors or dense features.
Abstract
Accurate camera pose estimation is a fundamental requirement for numerous applications, such as autonomous driving, mobile robotics, and augmented reality. In this work, we address the problem of estimating the global 6 DoF camera pose from a single RGB image in a given environment. Previous works consider every part of the image valuable for localization. However, many image regions such as the sky, occlusions, and repetitive non-distinguishable patterns cannot be utilized for localization. In addition to adding unnecessary computation efforts, extracting and matching features from such regions produce many wrong matches which in turn degrades the localization accuracy and efficiency. Our work addresses this particular issue and shows by exploiting an interesting concept of sparse 3D models that we can exploit discriminatory environment parts and avoid useless image regions for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
