Dress like a Star: Retrieving Fashion Products from Videos
Noa Garcia, George Vogiatzis

TL;DR
This paper introduces a video-based fashion product retrieval system that efficiently links clothing seen in videos to purchase options, significantly reducing memory use while maintaining high accuracy.
Contribution
The work presents a novel framework combining temporal aggregation and fast indexing for efficient fashion product retrieval from videos.
Findings
Memory requirements reduced by 42.5 times
Retrieval accuracy maintained at around 90%
Effective large-scale video dataset performance
Abstract
This work proposes a system for retrieving clothing and fashion products from video content. Although films and television are the perfect showcase for fashion brands to promote their products, spectators are not always aware of where to buy the latest trends they see on screen. Here, a framework for breaking the gap between fashion products shown on videos and users is presented. By relating clothing items and video frames in an indexed database and performing frame retrieval with temporal aggregation and fast indexing techniques, we can find fashion products from videos in a simple and non-intrusive way. Experiments in a large-scale dataset conducted here show that, by using the proposed framework, memory requirements can be reduced by 42.5X with respect to linear search, whereas accuracy is maintained at around 90%.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
