FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained   Diffusion Models and Monocular Depth Estimators

Haiping Wang; Yuan Liu; Bing Wang; Yujing Sun; Zhen Dong; Wenping; Wang; Bisheng Yang

arXiv:2310.03420·cs.CV·April 16, 2024·5 cites

FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators

Haiping Wang, Yuan Liu, Bing Wang, Yujing Sun, Zhen Dong, Wenping, Wang, Bisheng Yang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces FreeReg, a novel image-to-point cloud registration method that leverages pretrained diffusion models and monocular depth estimators to establish robust cross-modality correspondences without task-specific training.

Contribution

The paper proposes using diffusion features from pretrained models and geometric features from depth maps to improve cross-modality registration accuracy without additional training.

Findings

01

Achieves 20.6% higher Inlier Ratio on benchmarks.

02

Triples the Inlier Number compared to previous methods.

03

Improves registration recall by 48.6% over state-of-the-art.

Abstract

Matching cross-modality features between images and point clouds is a fundamental problem for image-to-point cloud registration. However, due to the modality difference between images and points, it is difficult to learn robust and discriminative cross-modality features by existing metric learning methods for feature matching. Instead of applying metric learning on cross-modality data, we propose to unify the modality between images and point clouds by pretrained large-scale models first, and then establish robust correspondence within the same modality. We show that the intermediate features, called diffusion features, extracted by depth-to-image diffusion models are semantically consistent between images and point clouds, which enables the building of coarse but robust cross-modality correspondences. We further extract geometric features on depth maps produced by the monocular depth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

WHU-USI3DV/FreeReg
pytorchOfficial

Videos

FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators· slideslive

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Remote Sensing and LiDAR Applications

MethodsDiffusion