No Fuss Distance Metric Learning using Proxies
Yair Movshovitz-Attias, Alexander Toshev, Thomas K. Leung, Sergey, Ioffe, Saurabh Singh

TL;DR
This paper introduces a proxy-based approach to distance metric learning that simplifies optimization, accelerates convergence, and improves performance on zero-shot learning tasks by using learned proxies to approximate data points.
Contribution
The paper proposes a novel proxy-based triplet loss that enhances optimization efficiency and achieves state-of-the-art results in zero-shot learning.
Findings
Up to 15% improvement on zero-shot learning datasets
Converges three times faster than traditional triplet methods
Empirically tighter upper bound of the original loss
Abstract
We address the problem of distance metric learning (DML), defined as learning a distance consistent with a notion of semantic similarity. Traditionally, for this problem supervision is expressed in the form of sets of points that follow an ordinal relationship -- an anchor point is similar to a set of positive points , and dissimilar to a set of negative points , and a loss defined over these distances is minimized. While the specifics of the optimization differ, in this work we collectively call this type of supervision Triplets and all methods that follow this pattern Triplet-Based methods. These methods are challenging to optimize. A main issue is the need for finding informative triplets, which is usually achieved by a variety of tricks such as increasing the batch size, hard or semi-hard triplet mining, etc. Even with these tricks, the convergence rate of such methods is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
