Deep Fishing: Gradient Features from Deep Nets

Albert Gordo; Adrien Gaidon; Florent Perronnin

arXiv:1507.06429·cs.CV·July 24, 2015·2 cites

Deep Fishing: Gradient Features from Deep Nets

Albert Gordo, Adrien Gaidon, Florent Perronnin

PDF

Open Access

TL;DR

This paper introduces a novel method that derives gradient-based features from convolutional networks, bridging deep learning and traditional Fisher Vector encoding, leading to improved image recognition performance.

Contribution

It presents a new way to extract gradient features from ConvNets, connecting deep learning with Fisher Vector encoding for enhanced image recognition.

Findings

01

Gradient features from ConvNets improve recognition accuracy.

02

Structured matrix representation enables efficient similarity computation.

03

Consistent performance gains on Pascal VOC datasets.

Abstract

Convolutional Networks (ConvNets) have recently improved image recognition performance thanks to end-to-end learning of deep feed-forward models from raw pixels. Deep learning is a marked departure from the previous state of the art, the Fisher Vector (FV), which relied on gradient-based encoding of local hand-crafted features. In this paper, we discuss a novel connection between these two approaches. First, we show that one can derive gradient representations from ConvNets in a similar fashion to the FV. Second, we show that this gradient representation actually corresponds to a structured matrix that allows for efficient similarity computation. We experimentally study the benefits of transferring this representation over the outputs of ConvNet layers, and find consistent improvements on the Pascal VOC 2007 and 2012 datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques