Virtual ID Discovery from E-commerce Media at Alibaba: Exploiting   Richness of User Click Behavior for Visual Search Relevance

Yanhao Zhang; Pan Pan; Yun Zheng; Kang Zhao; Jianmin Wu; Yinghui Xu,; Rong Jin

arXiv:2102.04667·cs.CV·February 10, 2021

Virtual ID Discovery from E-commerce Media at Alibaba: Exploiting Richness of User Click Behavior for Visual Search Relevance

Yanhao Zhang, Pan Pan, Yun Zheng, Kang Zhao, Jianmin Wu, Yinghui Xu,, Rong Jin

PDF

TL;DR

This paper introduces a click-data driven approach to improve visual search relevance in e-commerce by discovering Virtual IDs from user click behavior, eliminating the need for labeled data and enhancing image distinction.

Contribution

It proposes a novel Virtual ID discovery method leveraging user click data for training deep networks, improving visual search relevance without human annotations.

Findings

01

Better distinction of real-shot images in category and features

02

Consistent improvement over state-of-the-art methods in offline and online tests

03

Effective encoding of richer supervision from click data

Abstract

Visual search plays an essential role for E-commerce. To meet the search demands of users and promote shopping experience at Alibaba, visual search relevance of real-shot images is becoming the bottleneck. Traditional visual search paradigm is usually based upon supervised learning with labeled data. However, large-scale categorical labels are required with expensive human annotations, which limits its applicability and also usually fails in distinguishing the real-shot images. In this paper, we propose to discover Virtual ID from user click behavior to improve visual search relevance at Alibaba. As a totally click-data driven approach, we collect various types of click data for training deep networks without any human annotations at all. In particular, Virtual ID are learned as classification supervision with co-click embedding, which explores image relationship from user co-click…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.