Learning Visual Clothing Style with Heterogeneous Dyadic Co-occurrences

Andreas Veit; Balazs Kovacs; Sean Bell; Julian McAuley; Kavita Bala,; Serge Belongie

arXiv:1509.07473·cs.CV·September 25, 2015·41 cites

Learning Visual Clothing Style with Heterogeneous Dyadic Co-occurrences

Andreas Veit, Balazs Kovacs, Sean Bell, Julian McAuley, Kavita Bala,, Serge Belongie

PDF

Open Access

TL;DR

This paper introduces a novel framework using Siamese CNNs trained on large-scale co-occurrence data to learn visual compatibility across clothing categories, enabling the generation of matching outfits.

Contribution

It proposes a new method for learning cross-category visual compatibility using heterogeneous dyadic co-occurrences and a strategic sampling approach.

Findings

01

The framework effectively captures semantic visual style information.

02

It can generate compatible clothing outfits from different categories.

03

The approach outperforms baseline methods in compatibility prediction.

Abstract

With the rapid proliferation of smart mobile devices, users now take millions of photos every day. These include large numbers of clothing and accessory images. We would like to answer questions like `What outfit goes well with this pair of shoes?' To answer these types of questions, one has to go beyond learning visual similarity and learn a visual notion of compatibility across categories. In this paper, we propose a novel learning framework to help answer these types of questions. The main idea of this framework is to learn a feature transformation from images of items into a latent space that expresses compatibility. For the feature transformation, we use a Siamese Convolutional Neural Network (CNN) architecture, where training examples are pairs of items that are either compatible or incompatible. We model compatibility based on co-occurrence in large-scale user behavior data; in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Visual Attention and Saliency Detection · Human Pose and Action Recognition