Exploiting Local Features from Deep Networks for Image Retrieval
Joe Yue-Hei Ng, Fan Yang, Larry S. Davis

TL;DR
This paper explores how features from different layers of deep CNNs can be used for image retrieval, showing that intermediate layers often outperform the last layers, leading to improved retrieval performance.
Contribution
It introduces a method to extract and encode features from various CNN layers for image retrieval, demonstrating the effectiveness of intermediate layers over the last layer.
Findings
Intermediate CNN layers yield better retrieval results than last layers.
Using VLAD encoding with features from different layers improves performance.
State-of-the-art results achieved with 128-D VLAD descriptors on multiple datasets.
Abstract
Deep convolutional neural networks have been successfully applied to image classification tasks. When these same networks have been applied to image retrieval, the assumption has been made that the last layers would give the best performance, as they do in classification. We show that for instance-level image retrieval, lower layers often perform better than the last layers in convolutional neural networks. We present an approach for extracting convolutional features from different layers of the networks, and adopt VLAD encoding to encode features into a single vector for each image. We investigate the effect of different layers and scales of input images on the performance of convolutional features using the recent deep networks OxfordNet and GoogLeNet. Experiments demonstrate that intermediate layers or higher layers with finer scales produce better results for image retrieval,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Domain Adaptation and Few-Shot Learning
Methods1x1 Convolution · Convolution · Average Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling
