CRAFT: Complementary Recommendations Using Adversarial Feature Transformer
Cong Phuoc Huynh, Arridhana Ciptadi, Ambrish Tyagi, Amit Agrawal, (Amazon.com)

TL;DR
This paper introduces CRAFT, an unsupervised adversarial transformer framework that generates diverse visual recommendations for complementary fashion items, outperforming baseline methods in human evaluations.
Contribution
CRAFT is the first to use adversarial feature transformers for visual complementary item recommendation, enabling diverse and realistic suggestions without labeled data.
Findings
Recommendations are diverse and human-preferred.
The model effectively learns cross-category visual relationships.
Generated samples closely match real-world examples.
Abstract
Traditional approaches for complementary product recommendations rely on behavioral and non-visual data such as customer co-views or co-buys. However, certain domains such as fashion are primarily visual. We propose a framework that harnesses visual cues in an unsupervised manner to learn the distribution of co-occurring complementary items in real world images. Our model learns a non-linear transformation between the two manifolds of source and target complementary item categories (e.g., tops and bottoms in outfits). Given a large dataset of images containing instances of co-occurring object categories, we train a generative transformer network directly on the feature representation space by casting it as an adversarial optimization problem. Such a conditional generative model can produce multiple novel samples of complementary items (in the feature space) for a given query item. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Handwritten Text Recognition Techniques
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax
