Learning Unified Embedding for Apparel Recognition

Yang Song; Yuan Li; Bo Wu; Chao-Yeh Chen; Xiao Zhang; Hartwig Adam

arXiv:1707.05929·cs.CV·August 16, 2017·1 cites

Learning Unified Embedding for Apparel Recognition

Yang Song, Yuan Li, Bo Wu, Chao-Yeh Chen, Xiao Zhang, Hartwig Adam

PDF

Open Access

TL;DR

This paper proposes a unified embedding model for apparel recognition that matches the accuracy of multiple specialized models while reducing complexity, by optimizing training data selection and introducing a novel learning scheme using specialized model outputs as targets.

Contribution

It introduces a new training scheme using specialized model outputs as targets, enabling effective unified apparel recognition models without sacrificing accuracy.

Findings

01

Unified model achieves same accuracy as multiple specialized models

02

New loss function simplifies training and improves feature space utilization

03

Approach reduces model complexity and deployment costs

Abstract

In apparel recognition, specialized models (e.g. models trained for a particular vertical like dresses) can significantly outperform general models (i.e. models that cover a wide range of verticals). Therefore, deep neural network models are often trained separately for different verticals. However, using specialized models for different verticals is not scalable and expensive to deploy. This paper addresses the problem of learning one unified embedding model for multiple object verticals (e.g. all apparel classes) without sacrificing accuracy. The problem is tackled from two aspects: training data and training difficulty. On the training data aspect, we figure out that for a single model trained with triplet loss, there is an accuracy sweet spot in terms of how many verticals are trained together. To ease the training difficulty, a novel learning scheme is proposed by using the output…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Face recognition and analysis · Generative Adversarial Networks and Image Synthesis