Embracing Structure in Data for Billion-Scale Semantic Product Search
Vihan Lakshman, Choon Hui Teo, Xiaowen Chu, Priyanka Nigam, Abhinandan, Patni, Pooja Maknikar, SVN Vishwanathan

TL;DR
This paper introduces structure-aware methods for training and deploying neural embedding models at a billion scale, improving semantic product search by leveraging dataset structure to enhance efficiency and relevance.
Contribution
It proposes a bipartite graph partitioning approach to improve training and inference efficiency for dyadic neural embeddings in large-scale semantic search.
Findings
Effective partitioning reduces search space and improves retrieval speed.
Enhanced training with hard negative mining improves model relevance.
Demonstrated success on billion-scale Amazon product data.
Abstract
We present principled approaches to train and deploy dyadic neural embedding models at the billion scale, focusing our investigation on the application of semantic product search. When training a dyadic model, one seeks to embed two different types of entities (e.g., queries and documents or users and movies) in a common vector space such that pairs with high relevance are positioned nearby. During inference, given an embedding of one type (e.g., a query or a user), one seeks to retrieve the entities of the other type (e.g., documents or movies, respectively) that are highly relevant. In this work, we show that exploiting the natural structure of real-world datasets helps address both challenges efficiently. Specifically, we model dyadic data as a bipartite graph with edges between pairs with positive associations. We then propose to partition this network into semantically coherent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
