A Framework to Enhance Generalization of Deep Metric Learning methods using General Discriminative Feature Learning and Class Adversarial Neural Networks
Karrar Al-Kaabi, Reza Monsefi, Davood Zabihzadeh

TL;DR
This paper proposes a novel framework that enhances the generalization ability of deep metric learning methods in zero-shot learning scenarios by using feature map attention and class adversarial networks.
Contribution
It introduces a new approach combining intermediate feature attention and class adversarial training to improve zero-shot generalization in deep metric learning.
Findings
Improved generalization on unseen categories in zero-shot learning.
Enhanced discrimination power through attention mechanisms.
Effective use of class adversarial networks for invariant feature learning.
Abstract
Metric learning algorithms aim to learn a distance function that brings the semantically similar data items together and keeps dissimilar ones at a distance. The traditional Mahalanobis distance learning is equivalent to find a linear projection. In contrast, Deep Metric Learning (DML) methods are proposed that automatically extract features from data and learn a non-linear transformation from input space to a semantically embedding space. Recently, many DML methods are proposed focused to enhance the discrimination power of the learned metric by providing novel sampling strategies or loss functions. This approach is very helpful when both the training and test examples are coming from the same set of categories. However, it is less effective in many applications of DML such as image retrieval and person-reidentification. Here, the DML should learn general semantic concepts from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Face and Expression Recognition
