Three Things to Know about Deep Metric Learning
Yash Patel, Giorgos Tolias, Jiri Matas

TL;DR
This paper improves deep metric learning for image retrieval by proposing a differentiable loss, an efficient mixup regularization, and leveraging pre-trained models, leading to near state-of-the-art results.
Contribution
It introduces a differentiable surrogate loss, an efficient pairwise mixup regularization, and demonstrates the benefits of pre-trained model initialization for deep metric learning.
Findings
Nearly solves popular benchmarks with large models
Differentiable loss improves optimization of recall@k
Mixup regularization enhances model robustness
Abstract
This paper addresses supervised deep metric learning for open-set image retrieval, focusing on three key aspects: the loss function, mixup regularization, and model initialization. In deep metric learning, optimizing the retrieval evaluation metric, recall@k, via gradient descent is desirable but challenging due to its non-differentiable nature. To overcome this, we propose a differentiable surrogate loss that is computed on large batches, nearly equivalent to the entire training set. This computationally intensive process is made feasible through an implementation that bypasses the GPU memory limitations. Additionally, we introduce an efficient mixup regularization technique that operates on pairwise scalar similarities, effectively increasing the batch size even further. The training process is further enhanced by initializing the vision encoder using foundational models, which are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsMixup
