Learning Fashion Compatibility from In-the-wild Images
Additya Popli, Vijay Kumar, Sujit Jos, Saraansh Tandon

TL;DR
This paper introduces a self-supervised learning approach for fashion compatibility prediction using in-the-wild street images, reducing reliance on labeled datasets and improving cross-dataset performance.
Contribution
It proposes a novel self-supervised method leveraging co-occurrence of items on the same person and an adversarial loss to bridge domain gaps between in-the-wild and catalog images.
Findings
Outperforms existing self-supervised methods on fashion compatibility benchmarks.
Significant improvements in cross-dataset compatibility prediction.
Effective domain gap reduction between in-the-wild and catalog images.
Abstract
Complementary fashion recommendation aims at identifying items from different categories (e.g. shirt, footwear, etc.) that "go well together" as an outfit. Most existing approaches learn representation for this task using labeled outfit datasets containing manually curated compatible item combinations. In this work, we propose to learn representations for compatibility prediction from in-the-wild street fashion images through self-supervised learning by leveraging the fact that people often wear compatible outfits. Our pretext task is formulated such that the representations of different items worn by the same person are closer compared to those worn by other people. Additionally, to reduce the domain gap between in-the-wild and catalog images during inference, we introduce an adversarial loss that minimizes the difference in feature distribution between the two domains. We conduct our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Face recognition and analysis
