Large Scale Deep Convolutional Neural Network Features Search with   Lucene

Claudio Gennaro

arXiv:1603.09687·cs.CV·July 21, 2016·1 cites

Large Scale Deep Convolutional Neural Network Features Search with Lucene

Claudio Gennaro

PDF

Open Access

TL;DR

This paper presents a scalable method for indexing deep CNN features as text using Lucene, enabling efficient content-based image retrieval on massive datasets of around 100 million images.

Contribution

The authors introduce a novel approach to convert CNN features into textual form for indexing with Lucene, supporting large-scale image retrieval combining text and visual search.

Findings

01

Effective indexing of CNN features as text for large datasets

02

Optimized textual representation improves index size and response time

03

Prototype demonstrates retrieval on 100 million images

Abstract

In this work, we propose an approach to index Deep Convolutional Neural Network Features to support efficient content-based retrieval on large image databases. To this aim, we have converted the these features into a textual form, to index them into an inverted index by means of Lucene. In this way, we were able to set up a robust retrieval system that combines full-text search with content-based image retrieval capabilities. We evaluated different strategies of textual representation in order to optimize the index occupation and the query response time. In order to show that our approach is able to handle large datasets, we have developed a web-based prototype that provides an interface for combined textual and visual searching into a dataset of about 100 million of images.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications