Efficient On-the-fly Category Retrieval using ConvNets and GPUs
Ken Chatfield, Karen Simonyan, Andrew Zisserman

TL;DR
This paper demonstrates that ConvNets combined with GPU acceleration enable fast, memory-efficient, and accurate on-the-fly image and video category retrieval, outperforming previous methods in speed and precision.
Contribution
It introduces a GPU-based framework using ConvNets for real-time classifier learning and ranking, with reduced feature dimensionality and improved retrieval performance.
Findings
ConvNets provide highly performant, low-dimensional features for large-scale retrieval.
Compression techniques like product quantization and binarization further reduce feature size without performance loss.
The system achieves under one second retrieval and classifier training on a single GPU.
Abstract
We investigate the gains in precision and speed, that can be obtained by using Convolutional Networks (ConvNets) for on-the-fly retrieval - where classifiers are learnt at run time for a textual query from downloaded images, and used to rank large image or video datasets. We make three contributions: (i) we present an evaluation of state-of-the-art image representations for object category retrieval over standard benchmark datasets containing 1M+ images; (ii) we show that ConvNets can be used to obtain features which are incredibly performant, and yet much lower dimensional than previous state-of-the-art image representations, and that their dimensionality can be reduced further without loss in performance by compression using product quantization or binarization. Consequently, features with the state-of-the-art performance on large-scale datasets of millions of images can fit in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsSupport Vector Machine
