Efficient On-the-fly Category Retrieval using ConvNets and GPUs

Ken Chatfield; Karen Simonyan; Andrew Zisserman

arXiv:1407.4764·cs.CV·November 18, 2014·2 cites

Efficient On-the-fly Category Retrieval using ConvNets and GPUs

Ken Chatfield, Karen Simonyan, Andrew Zisserman

PDF

Open Access

TL;DR

This paper demonstrates that ConvNets combined with GPU acceleration enable fast, memory-efficient, and accurate on-the-fly image and video category retrieval, outperforming previous methods in speed and precision.

Contribution

It introduces a GPU-based framework using ConvNets for real-time classifier learning and ranking, with reduced feature dimensionality and improved retrieval performance.

Findings

01

ConvNets provide highly performant, low-dimensional features for large-scale retrieval.

02

Compression techniques like product quantization and binarization further reduce feature size without performance loss.

03

The system achieves under one second retrieval and classifier training on a single GPU.

Abstract

We investigate the gains in precision and speed, that can be obtained by using Convolutional Networks (ConvNets) for on-the-fly retrieval - where classifiers are learnt at run time for a textual query from downloaded images, and used to rank large image or video datasets. We make three contributions: (i) we present an evaluation of state-of-the-art image representations for object category retrieval over standard benchmark datasets containing 1M+ images; (ii) we show that ConvNets can be used to obtain features which are incredibly performant, and yet much lower dimensional than previous state-of-the-art image representations, and that their dimensionality can be reduced further without loss in performance by compression using product quantization or binarization. Consequently, features with the state-of-the-art performance on large-scale datasets of millions of images can fit in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsSupport Vector Machine