Fine-grained Image Retrieval via Dual-Vision Adaptation
Xin Jiang, Meiqi Cao, Hao Tang, Fei Shen, Zechao Li

TL;DR
This paper introduces a Dual-Vision Adaptation method for fine-grained image retrieval that enhances generalization by collaboratively adapting samples and features while leveraging pre-trained models.
Contribution
The proposed DVA approach enables effective fine-grained image retrieval by combining sample and feature adaptation without overfitting, utilizing knowledge transfer from pre-trained models.
Findings
DVA outperforms existing methods on multiple datasets.
Fewer learnable parameters than traditional fine-tuning.
Effective in both in-distribution and out-of-distribution scenarios.
Abstract
Fine-Grained Image Retrieval~(FGIR) faces challenges in learning discriminative visual representations to retrieve images with similar fine-grained features. Current leading FGIR solutions typically follow two regimes: enforce pairwise similarity constraints in the semantic embedding space, or incorporate a localization sub-network to fine-tune the entire model. However, such two regimes tend to overfit the training data while forgetting the knowledge gained from large-scale pre-training, thus reducing their generalization ability. In this paper, we propose a Dual-Vision Adaptation (DVA) approach for FGIR, which guides the frozen pre-trained model to perform FGIR through collaborative sample and feature adaptation. Specifically, we design Object-Perceptual Adaptation, which modifies input samples to help the pre-trained model perceive critical objects and elements within objects that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Image Retrieval and Classification Techniques
MethodsSparse Evolutionary Training · Knowledge Distillation
