TL;DR
This paper introduces an unsupervised method that uses probabilistic topic models to discover and organize fashion styles from unlabeled images, enabling style-based retrieval and summarization.
Contribution
It presents a novel unsupervised approach leveraging visual attribute-based topic models to identify latent style factors in fashion images.
Findings
Successfully organized over 100K images by style
Enabled style-based retrieval and outfit summarization
Demonstrated effectiveness without labeled data
Abstract
What defines a visual style? Fashion styles emerge organically from how people assemble outfits of clothing, making them difficult to pin down with a computational model. Low-level visual similarity can be too specific to detect stylistically similar images, while manually crafted style categories can be too abstract to capture subtle style differences. We propose an unsupervised approach to learn a style-coherent representation. Our method leverages probabilistic polylingual topic models based on visual attributes to discover a set of latent style factors. Given a collection of unlabeled fashion images, our approach mines for the latent styles, then summarizes outfits by how they mix those styles. Our approach can organize galleries of outfits by style without requiring any style labels. Experiments on over 100K images demonstrate its promise for retrieving, mixing, and summarizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
