Selective Convolutional Descriptor Aggregation for Fine-Grained Image Retrieval
Xiu-Shen Wei, Jian-Hao Luo, Jianxin Wu, Zhi-Hua Zhou

TL;DR
This paper introduces SCDA, an unsupervised method that localizes objects and aggregates deep descriptors to improve fine-grained image retrieval, achieving high accuracy without requiring labels or bounding boxes.
Contribution
The paper presents a novel unsupervised approach, SCDA, for fine-grained image retrieval that localizes objects and extracts meaningful features without annotations.
Findings
SCDA outperforms existing methods on six fine-grained datasets.
SCDA features correspond to visual attributes, aiding interpretability.
Achieves comparable results to state-of-the-art on general image retrieval datasets.
Abstract
Deep convolutional neural network models pre-trained for the ImageNet classification task have been successfully adopted to tasks in other domains, such as texture description and object proposal generation, but these tasks require annotations for images in the new domain. In this paper, we focus on a novel and challenging task in the pure unsupervised setting: fine-grained image retrieval. Even with image labels, fine-grained images are difficult to classify, let alone the unsupervised retrieval task. We propose the Selective Convolutional Descriptor Aggregation (SCDA) method. SCDA firstly localizes the main object in fine-grained images, a step that discards the noisy background and keeps useful deep descriptors. The selected descriptors are then aggregated and dimensionality reduced into a short feature vector using the best practices we found. SCDA is unsupervised, using no image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Advanced Neural Network Applications
