Quantity-Aware Coarse-to-Fine Correspondence for Image-to-Point Cloud   Registration

Gongxin Yao; Yixin Xuan; Yiwei Chen; Yu Pan

arXiv:2307.07142·cs.CV·January 19, 2024·2 cites

Quantity-Aware Coarse-to-Fine Correspondence for Image-to-Point Cloud Registration

Gongxin Yao, Yixin Xuan, Yiwei Chen, Yu Pan

PDF

Open Access

TL;DR

This paper introduces a quantity-aware coarse-to-fine framework for image-to-point cloud registration, improving matching accuracy by leveraging semantic alignment and adaptive correlation quantification, leading to superior performance on benchmark datasets.

Contribution

It proposes a novel quantity-aware correspondence framework with adaptive supervision and refinement strategies for improved image-to-point cloud registration.

Findings

01

Outperforms state-of-the-art methods on KITTI Odometry dataset.

02

Achieves higher registration accuracy on NuScenes dataset.

03

Demonstrates robustness and efficiency in correspondence matching.

Abstract

Image-to-point cloud registration aims to determine the relative camera pose between an RGB image and a reference point cloud, serving as a general solution for locating 3D objects from 2D observations. Matching individual points with pixels can be inherently ambiguous due to modality gaps. To address this challenge, we propose a framework to capture quantity-aware correspondences between local point sets and pixel patches and refine the results at both the point and pixel levels. This framework aligns the high-level semantics of point sets and pixel patches to improve the matching accuracy. On a coarse scale, the set-to-patch correspondence is expected to be influenced by the quantity of 3D points. To achieve this, a novel supervision strategy is proposed to adaptively quantify the degrees of correlation as continuous values. On a finer scale, point-to-pixel correspondences are refined…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Softmax · Label Smoothing · Residual Connection · Absolute Position Encodings · Adam