IDEAL: Independent Domain Embedding Augmentation Learning

Zhiyuan Chen; Guang Yao; Wennan Ma; Lin Xu

arXiv:2105.10112·cs.CV·May 24, 2021·1 cites

IDEAL: Independent Domain Embedding Augmentation Learning

Zhiyuan Chen, Guang Yao, Wennan Ma, Lin Xu

PDF

Open Access

TL;DR

The paper introduces IDEAL, a novel data transformation technique that learns multiple independent embedding spaces for different data domains, significantly improving visual retrieval performance when combined with existing deep metric learning methods.

Contribution

IDEAL is a new mechanism that learns multiple independent embedding spaces for data transformations, enhancing deep metric learning without altering existing loss functions.

Findings

01

IDEAL improves MS loss performance on Cars-196 from 84.5% to 87.1%.

02

IDEAL achieves state-of-the-art results on Cars-196, CUB-200, and SOP benchmarks.

03

IDEAL outperforms recent DML methods like Circle loss and XBM.

Abstract

Many efforts have been devoted to designing sampling, mining, and weighting strategies in high-level deep metric learning (DML) loss objectives. However, little attention has been paid to low-level but essential data transformation. In this paper, we develop a novel mechanism, the independent domain embedding augmentation learning ({IDEAL}) method. It can simultaneously learn multiple independent embedding spaces for multiple domains generated by predefined data transformations. Our IDEAL is orthogonal to existing DML techniques and can be seamlessly combined with prior DML approaches for enhanced performance. Empirical results on visual retrieval tasks demonstrate the superiority of the proposed method. For example, the IDEAL improves the performance of MS loss by a large margin, 84.5\% $\to$ 87.1\% on Cars-196, and 65.8\% $\to$ 69.5\% on CUB-200 at Recall $@1$ . Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques