Learning a Product Relevance Model from Click-Through Data in E-Commerce
Shaowei Yao, Jiwei Tan, Xi Chen, Keping Yang, Rong Xiao, Hongbo Deng, and Xiaojun Wan

TL;DR
This paper introduces a new framework for learning product relevance models from noisy click-through data in e-commerce, improving relevance estimation despite data ambiguity and noise.
Contribution
It proposes a novel training approach that uses fine-grained relevance confidence and a new objective to enhance robustness of relevance models from click data.
Findings
Achieves promising offline and online performance
Deployed in Taobao for over a year
Handles noisy click data effectively
Abstract
The search engine plays a fundamental role in online e-commerce systems, to help users find the products they want from the massive product collections. Relevance is an essential requirement for e-commerce search, since showing products that do not match search query intent will degrade user experience. With the existence of vocabulary gap between user language of queries and seller language of products, measuring semantic relevance is necessary and neural networks are engaged to address this task. However, semantic relevance is different from click-through rate prediction in that no direct training signal is available. Most previous attempts learn relevance models from user click-through data that are cheap and abundant. Unfortunately, click behavior is noisy and misleading, which is affected by not only relevance but also factors including price, image and attractive titles.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
