SIR: Similar Image Retrieval for Product Search in E-Commerce
Theban Stanley, Nihar Vanjara, Yanxin Pan, Ekaterina Pirogova, Swagata, Chakraborty, Abon Chaudhuri

TL;DR
This paper introduces SIR, a visual similarity-based image retrieval system for e-commerce, enabling quick discovery of visually similar products without relying on predefined labels, suitable for dynamic and short-lived themes.
Contribution
The paper presents a scalable SIR platform with a novel embedding and indexing approach, tailored for real-time, theme-agnostic product search in large, diverse catalogs.
Findings
Effective in detecting objectionable and trending themes rapidly.
Helps discover visual variants difficult to find via text search.
Performs well in real-world e-commerce applications.
Abstract
We present a similar image retrieval (SIR) platform that is used to quickly discover visually similar products in a catalog of millions. Given the size, diversity, and dynamism of our catalog, product search poses many challenges. It can be addressed by building supervised models to tagging product images with labels representing themes and later retrieving them by labels. This approach suffices for common and perennial themes like "white shirt" or "lifestyle image of TV". It does not work for new themes such as "e-cigarettes", hard-to-define ones such as "image with a promotional badge", or the ones with short relevance span such as "Halloween costumes". SIR is ideal for such cases because it allows us to search by an example, not a pre-defined theme. We describe the steps - embedding computation, encoding, and indexing - that power the approximate nearest neighbor search back-end. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
