Learning Fashion Compatibility with Bidirectional LSTMs

Xintong Han; Zuxuan Wu; Yu-Gang Jiang; Larry S. Davis

arXiv:1707.05691·cs.CV·July 19, 2017

Learning Fashion Compatibility with Bidirectional LSTMs

Xintong Han, Zuxuan Wu, Yu-Gang Jiang, Larry S. Davis

PDF

2 Repos

TL;DR

This paper introduces a bidirectional LSTM-based model for fashion recommendation, capable of suggesting matching items and generating outfits from multimodal inputs, by learning compatibility and visual-semantic embeddings.

Contribution

It proposes an end-to-end framework that models fashion compatibility as a sequence prediction problem using Bi-LSTM and visual-semantic embedding, advancing outfit recommendation methods.

Findings

01

Outperforms alternative methods on Polyvore dataset

02

Effectively predicts outfit compatibility

03

Generates outfits from multimodal specifications

Abstract

The ubiquity of online fashion shopping demands effective recommendation services for customers. In this paper, we study two types of fashion recommendation: (i) suggesting an item that matches existing components in a set to form a stylish outfit (a collection of fashion items), and (ii) generating an outfit with multimodal (images/text) specifications from a user. To this end, we propose to jointly learn a visual-semantic embedding and the compatibility relationships among fashion items in an end-to-end fashion. More specifically, we consider a fashion outfit to be a sequence (usually from top to bottom and then accessories) and each item in the outfit as a time step. Given the fashion items in an outfit, we train a bidirectional LSTM (Bi-LSTM) model to sequentially predict the next item conditioned on previous ones to learn their compatibility relationships. Further, we learn a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory