IDEAL: Independent Domain Embedding Augmentation Learning
Zhiyuan Chen, Guang Yao, Wennan Ma, Lin Xu

TL;DR
The paper introduces IDEAL, a novel data transformation technique that learns multiple independent embedding spaces for different data domains, significantly improving visual retrieval performance when combined with existing deep metric learning methods.
Contribution
IDEAL is a new mechanism that learns multiple independent embedding spaces for data transformations, enhancing deep metric learning without altering existing loss functions.
Findings
IDEAL improves MS loss performance on Cars-196 from 84.5% to 87.1%.
IDEAL achieves state-of-the-art results on Cars-196, CUB-200, and SOP benchmarks.
IDEAL outperforms recent DML methods like Circle loss and XBM.
Abstract
Many efforts have been devoted to designing sampling, mining, and weighting strategies in high-level deep metric learning (DML) loss objectives. However, little attention has been paid to low-level but essential data transformation. In this paper, we develop a novel mechanism, the independent domain embedding augmentation learning ({IDEAL}) method. It can simultaneously learn multiple independent embedding spaces for multiple domains generated by predefined data transformations. Our IDEAL is orthogonal to existing DML techniques and can be seamlessly combined with prior DML approaches for enhanced performance. Empirical results on visual retrieval tasks demonstrate the superiority of the proposed method. For example, the IDEAL improves the performance of MS loss by a large margin, 84.5\% 87.1\% on Cars-196, and 65.8\% 69.5\% on CUB-200 at Recall. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
