Fast Interactive Image Retrieval using large-scale unlabeled data
Akshay Mehra, Jihun Hamm, Mikhail Belkin

TL;DR
This paper introduces a scalable, interactive image retrieval system that combines active learning and graph-based semi-supervised learning to efficiently identify relevant images with minimal user feedback in large datasets.
Contribution
It proposes a novel scalable GSSL approach integrated with active learning and multi-modal data to improve interactive image retrieval performance.
Findings
High F1 scores achieved with few feedback rounds
Scalable GSSL reduces complexity from O(n^3) to O(n)
Effective on large datasets like Imagenet with 1.2 million images
Abstract
An interactive image retrieval system learns which images in the database belong to a user's query concept, by analyzing the example images and feedback provided by the user. The challenge is to retrieve the relevant images with minimal user interaction. In this work, we propose to solve this problem by posing it as a binary classification task of classifying all images in the database as being relevant or irrelevant to the user's query concept. Our method combines active learning with graph-based semi-supervised learning (GSSL) to tackle this problem. Active learning reduces the number of user interactions by querying the labels of the most informative points and GSSL allows to use abundant unlabeled data along with the limited labeled data provided by the user. To efficiently find the most informative point, we use an uncertainty sampling based method that queries the label of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Machine Learning and Algorithms · Image Retrieval and Classification Techniques
