Rethinking preventing class-collapsing in metric learning with margin-based losses
Elad Levi, Tete Xiao, Xiaolong Wang, Trevor Darrell

TL;DR
This paper identifies a class collapse problem in margin-based metric learning losses and proposes a simple sampling modification that allows multiple sub-clusters per class, improving retrieval performance.
Contribution
The paper introduces a novel sampling strategy for margin-based losses that prevents class collapse and supports multiple sub-clusters within classes.
Findings
Improved retrieval accuracy on fine-grained datasets.
Qualitative evidence of better class separation in embeddings.
The method is compatible with various margin-based losses.
Abstract
Metric learning seeks perceptual embeddings where visually similar instances are close and dissimilar instances are apart, but learned representations can be sub-optimal when the distribution of intra-class samples is diverse and distinct sub-clusters are present. Although theoretically with optimal assumptions, margin-based losses such as the triplet loss and margin loss have a diverse family of solutions. We theoretically prove and empirically show that under reasonable noise assumptions, margin-based losses tend to project all samples of a class with various modes onto a single point in the embedding space, resulting in a class collapse that usually renders the space ill-sorted for classification or retrieval. To address this problem, we propose a simple modification to the embedding losses such that each sample selects its nearest same-class counterpart in a batch as the positive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · AI in cancer detection
