Three Things to Know about Deep Metric Learning

Yash Patel; Giorgos Tolias; Jiri Matas

arXiv:2412.12432·cs.CV·December 18, 2024

Three Things to Know about Deep Metric Learning

Yash Patel, Giorgos Tolias, Jiri Matas

PDF

Open Access

TL;DR

This paper improves deep metric learning for image retrieval by proposing a differentiable loss, an efficient mixup regularization, and leveraging pre-trained models, leading to near state-of-the-art results.

Contribution

It introduces a differentiable surrogate loss, an efficient pairwise mixup regularization, and demonstrates the benefits of pre-trained model initialization for deep metric learning.

Findings

01

Nearly solves popular benchmarks with large models

02

Differentiable loss improves optimization of recall@k

03

Mixup regularization enhances model robustness

Abstract

This paper addresses supervised deep metric learning for open-set image retrieval, focusing on three key aspects: the loss function, mixup regularization, and model initialization. In deep metric learning, optimizing the retrieval evaluation metric, recall@k, via gradient descent is desirable but challenging due to its non-differentiable nature. To overcome this, we propose a differentiable surrogate loss that is computed on large batches, nearly equivalent to the entire training set. This computationally intensive process is made feasible through an implementation that bypasses the GPU memory limitations. Additionally, we introduce an efficient mixup regularization technique that operates on pairwise scalar similarities, effectively increasing the batch size even further. The training process is further enhanced by initializing the vision encoder using foundational models, which are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsMixup