Fracking Deep Convolutional Image Descriptors

Edgar Simo-Serra; Eduard Trulls; Luis Ferraz; Iasonas; Kokkinos; Francesc Moreno-Noguer

arXiv:1412.6537·cs.CV·February 27, 2015·63 cites

Fracking Deep Convolutional Image Descriptors

Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas, Kokkinos, Francesc Moreno-Noguer

PDF

Open Access

TL;DR

This paper introduces a discriminative deep learning framework for local image descriptors using a siamese CNN architecture with a novel training strategy called 'fracking', achieving significant performance improvements over traditional and existing learned descriptors.

Contribution

It presents a new training approach for siamese CNNs with aggressive mining, leading to superior local image descriptor performance.

Findings

01

Achieves up to 2.5x better than SIFT in AUC

02

Outperforms previous state-of-the-art learned descriptors

03

Demonstrates large performance gains through hyper-parameter tuning

Abstract

In this paper we propose a novel framework for learning local image descriptors in a discriminative manner. For this purpose we explore a siamese architecture of Deep Convolutional Neural Networks (CNN), with a Hinge embedding loss on the L2 distance between descriptors. Since a siamese architecture uses pairs rather than single image patches to train, there exist a large number of positive samples and an exponential number of negative samples. We propose to explore this space with a stochastic sampling of the training set, in combination with an aggressive mining strategy over both the positive and negative samples which we denote as "fracking". We perform a thorough evaluation of the architecture hyper-parameters, and demonstrate large performance gains compared to both standard CNN learning strategies, hand-crafted image descriptors like SIFT, and the state-of-the-art on learned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Processing Techniques and Applications · Image and Object Detection Techniques