PN-Net: Conjoined Triple Deep Network for Learning Local Image   Descriptors

Vassileios Balntas; Edward Johns; Lilian Tang; Krystian; Mikolajczyk

arXiv:1601.05030·cs.CV·January 20, 2016·155 cites

PN-Net: Conjoined Triple Deep Network for Learning Local Image Descriptors

Vassileios Balntas, Edward Johns, Lilian Tang, Krystian, Mikolajczyk

PDF

Open Access 1 Repo

TL;DR

This paper introduces PN-Net, a CNN-based local image descriptor that achieves high matching accuracy with low computational cost by using triplet training and a novel loss function, outperforming existing methods in speed and memory efficiency.

Contribution

PN-Net presents a new CNN-based local descriptor trained with triplets and a novel loss function, offering improved performance and efficiency over prior descriptors.

Findings

01

Outperforms MatchNet and DeepCompare in accuracy.

02

Extraction time comparable to binary descriptors like BRIEF and ORB.

03

Significantly reduced training and execution time.

Abstract

In this paper we propose a new approach for learning local descriptors for matching image patches. It has recently been demonstrated that descriptors based on convolutional neural networks (CNN) can significantly improve the matching performance. Unfortunately their computational complexity is prohibitive for any practical application. We address this problem and propose a CNN based descriptor with improved matching performance, significantly reduced training and execution time, as well as low dimensionality. We propose to train the network with triplets of patches that include a positive and negative pairs. To that end we introduce a new loss function that exploits the relations within the triplets. We compare our approach to recently introduced MatchNet and DeepCompare and demonstrate the advantages of our descriptor in terms of performance, memory footprint and speed i.e. when run…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vbalnt/pnnet
torchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Remote-Sensing Image Classification

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings