Visual Search at Alibaba
Yanhao Zhang, Pan Pan, Yun Zheng, Kang Zhao, Yingya Zhang, Xiaofeng, Ren, Rong Jin

TL;DR
This paper presents Alibaba's large-scale visual search system, addressing challenges like heterogeneous data, large-scale indexing, and effective deep model training, using deep learning and system optimization for real-world commercial deployment.
Contribution
The paper introduces a comprehensive visual search system at Alibaba, including novel deep CNN models, fusion techniques, and scalable indexing methods tailored for large-scale e-commerce applications.
Findings
Effective category prediction through model and search fusion.
Deep CNN for joint detection and feature learning from user clicks.
Scalable binary indexing maintaining high recall and precision.
Abstract
This paper introduces the large scale visual search algorithm and system infrastructure at Alibaba. The following challenges are discussed under the E-commercial circumstance at Alibaba (a) how to handle heterogeneous image data and bridge the gap between real-shot images from user query and the online images. (b) how to deal with large scale indexing for massive updating data. (c) how to train deep models for effective feature representation without huge human annotations. (d) how to improve the user engagement by considering the quality of the content. We take advantage of large image collection of Alibaba and state-of-the-art deep learning techniques to perform visual search at scale. We present solutions and implementation details to overcome those problems and also share our learnings from building such a large scale commercial visual search engine. Specifically, model and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
