P2-Net: Joint Description and Detection of Local Features for Pixel and   Point Matching

Bing Wang; Changhao Chen; Zhaopeng Cui; Jie Qin; Chris Xiaoxuan Lu,; Zhengdi Yu; Peijun Zhao; Zhen Dong; Fan Zhu; Niki Trigoni; Andrew Markham

arXiv:2103.01055·cs.CV·July 30, 2021

P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching

Bing Wang, Changhao Chen, Zhaopeng Cui, Jie Qin, Chris Xiaoxuan Lu,, Zhengdi Yu, Peijun Zhao, Zhen Dong, Fan Zhu, Niki Trigoni, Andrew Markham

PDF

Open Access 1 Repo

TL;DR

This paper introduces P2-Net, a dual framework that jointly describes and detects local features for pixel and point matching, enabling accurate cross-modal correspondences between 2D images and 3D point clouds.

Contribution

It proposes a shared latent space for 2D and 3D features, along with an ultra-wide reception mechanism and novel loss function for improved matching.

Findings

01

Achieves state-of-the-art indoor visual localization results.

02

Demonstrates competitive performance in fine-grained image and point cloud matching.

03

Validates effectiveness through extensive experiments.

Abstract

Accurately describing and detecting 2D and 3D keypoints is crucial to establishing correspondences across images and point clouds. Despite a plethora of learning-based 2D or 3D local feature descriptors and detectors having been proposed, the derivation of a shared descriptor and joint keypoint detector that directly matches pixels and points remains under-explored by the community. This work takes the initiative to establish fine-grained correspondences between 2D images and 3D point clouds. In order to directly match pixels and points, a dual fully convolutional framework is presented that maps 2D and 3D inputs into a shared latent representation space to simultaneously describe and detect keypoints. Furthermore, an ultra-wide reception mechanism in combination with a novel loss function are designed to mitigate the intrinsic information variations between pixel and point local…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BingCS/P2-Net
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · 3D Surveying and Cultural Heritage