Rethinking Deep Contrastive Learning with Embedding Memory
Haozhi Zhang, Xun Wang, Weilin Huang, Matthew R. Scott

TL;DR
This paper introduces a systematic approach to analyze pair-wise loss functions in deep metric learning using embedding memory, emphasizing the importance of mining hard negatives and simplifying weighting schemes for improved performance.
Contribution
It proposes a new methodology for studying pair weighting strategies in memory-based deep metric learning, revealing insights that lead to a simple yet effective weighting rule.
Findings
Memory-based DML effectively mines hard negatives.
Weighting positive pairs is less beneficial than focusing on negatives.
Simple contrastive loss with memory outperforms complex mini-batch methods.
Abstract
Pair-wise loss functions have been extensively studied and shown to continuously improve the performance of deep metric learning (DML). However, they are primarily designed with intuition based on simple toy examples, and experimentally identifying the truly effective design is difficult in complicated, real-world cases. In this paper, we provide a new methodology for systematically studying weighting strategies of various pair-wise loss functions, and rethink pair weighting with an embedding memory. We delve into the weighting mechanisms by decomposing the pair-wise functions, and study positive and negative weights separately using direct weight assignment. This allows us to study various weighting functions deeply and systematically via weight curves, and identify a number of meaningful, comprehensive and insightful facts, which come up with our key observation on memory-based DML:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Face recognition and analysis · Multimodal Machine Learning Applications
