Optimization of Rank Losses for Image Retrieval
Elias Ramzi, Nicolas Audebert, Cl\'ement Rambour, Andr\'e Araujo,, Xavier Bitot, Nicolas Thome

TL;DR
This paper introduces a novel framework for optimizing rank-based loss functions in image retrieval, addressing non-differentiability and non-decomposability issues to improve training robustness and performance.
Contribution
It proposes SupRank, a surrogate for ranking operators, and a loss to bridge the decomposability gap, enabling end-to-end training for ranking metrics like AP and R@k.
Findings
SupRank provides a robust upper bound for rank losses.
The framework improves optimization of AP and R@k metrics.
A new hierarchical landmarks dataset was created for evaluation.
Abstract
In image retrieval, standard evaluation metrics rely on score ranking, \eg average precision (AP), recall at k (R@k), normalized discounted cumulative gain (NDCG). In this work we introduce a general framework for robust and decomposable rank losses optimization. It addresses two major challenges for end-to-end training of deep neural networks with rank losses: non-differentiability and non-decomposability. Firstly we propose a general surrogate for ranking operator, SupRank, that is amenable to stochastic gradient descent. It provides an upperbound for rank losses and ensures robust training. Secondly, we use a simple yet effective loss function to reduce the decomposability gap between the averaged batch approximation of ranking losses and their values on the whole training set. We apply our framework to two standard metrics for image retrieval: AP and R@k. Additionally we apply our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
