Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval
Yunquan Zhu, Xinkai Gao, Bo Ke, Ruizhi Qiao, Xing Sun

TL;DR
This paper introduces a Coarse-to-Fine framework for single-stage image retrieval that learns compact, discriminative global representations using adaptive loss and local descriptor selection, achieving state-of-the-art results.
Contribution
It proposes a novel end-to-end learning method combining adaptive softmax loss and local descriptor selection to improve single-stage image retrieval performance.
Findings
Achieves state-of-the-art results on Revisited Oxford and Paris datasets.
Effectively balances retrieval accuracy and efficiency.
Demonstrates robustness across various challenging scenarios.
Abstract
Image retrieval targets to find images from a database that are visually similar to the query image. Two-stage methods following retrieve-and-rerank paradigm have achieved excellent performance, but their separate local and global modules are inefficient to real-world applications. To better trade-off retrieval efficiency and accuracy, some approaches fuse global and local feature into a joint representation to perform single-stage image retrieval. However, they are still challenging due to various situations to tackle, , background, occlusion and viewpoint. In this work, we design a Coarse-to-Fine framework to learn Compact Discriminative representation (CFCD) for end-to-end single-stage image retrieval-requiring only image-level labels. Specifically, we first design a novel adaptive softmax-based loss which dynamically tunes its scale and margin within each mini-batch and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval· youtube
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Domain Adaptation and Few-Shot Learning
