Quantity-Aware Coarse-to-Fine Correspondence for Image-to-Point Cloud Registration
Gongxin Yao, Yixin Xuan, Yiwei Chen, Yu Pan

TL;DR
This paper introduces a quantity-aware coarse-to-fine framework for image-to-point cloud registration, improving matching accuracy by leveraging semantic alignment and adaptive correlation quantification, leading to superior performance on benchmark datasets.
Contribution
It proposes a novel quantity-aware correspondence framework with adaptive supervision and refinement strategies for improved image-to-point cloud registration.
Findings
Outperforms state-of-the-art methods on KITTI Odometry dataset.
Achieves higher registration accuracy on NuScenes dataset.
Demonstrates robustness and efficiency in correspondence matching.
Abstract
Image-to-point cloud registration aims to determine the relative camera pose between an RGB image and a reference point cloud, serving as a general solution for locating 3D objects from 2D observations. Matching individual points with pixels can be inherently ambiguous due to modality gaps. To address this challenge, we propose a framework to capture quantity-aware correspondences between local point sets and pixel patches and refine the results at both the point and pixel levels. This framework aligns the high-level semantics of point sets and pixel patches to improve the matching accuracy. On a coarse scale, the set-to-patch correspondence is expected to be influenced by the quantity of 3D points. To achieve this, a novel supervision strategy is proposed to adaptively quantify the degrees of correlation as continuous values. On a finer scale, point-to-pixel correspondences are refined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Softmax · Label Smoothing · Residual Connection · Absolute Position Encodings · Adam
