Deep Metric Learning with Soft Orthogonal Proxies
Farshad Saberi-Movahed, Mohammad K.Ebrahimpour, Farid Saberi-Movahed,, Monireh Moshavash, Dorsa Rahmatian, Mahvash Mohazzebi, Mahdi Shariatzadeh,, Mahdi Eftekhari

TL;DR
This paper introduces a novel Soft Orthogonality constraint for proxies in Deep Metric Learning, improving class separation and reducing redundancy, leading to superior image retrieval performance.
Contribution
The paper proposes a Soft Orthogonality regularization for proxies in DML, enhancing their distribution and improving retrieval accuracy.
Findings
Outperforms state-of-the-art methods on four benchmarks
Proxies become more orthogonal and less correlated
Improved convergence and retrieval accuracy
Abstract
Deep Metric Learning (DML) models rely on strong representations and similarity-based measures with specific loss functions. Proxy-based losses have shown great performance compared to pair-based losses in terms of convergence speed. However, proxies that are assigned to different classes may end up being closely located in the embedding space and hence having a hard time to distinguish between positive and negative items. Alternatively, they may become highly correlated and hence provide redundant information with the model. To address these issues, we propose a novel approach that introduces Soft Orthogonality (SO) constraint on proxies. The constraint ensures the proxies to be as orthogonal as possible and hence control their positions in the embedding space. Our approach leverages Data-Efficient Image Transformer (DeiT) as an encoder to extract contextual features from images along…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Layer Normalization · Label Smoothing · Adam · Byte Pair Encoding · Residual Connection
