Learning From Weights: A Cost-Sensitive Approach For Ad Retrieval
Nikit Begwani, Shrutendra Harsola, Rahul Agrawal

TL;DR
This paper introduces a cost-sensitive weighting strategy for retrieval models that improves click-through rates and user engagement by effectively handling long-tail query-document pairs in sponsored search and product recommendation.
Contribution
It proposes a novel weighting approach to incorporate long-tail pairs in training, enhancing semantic representations without sacrificing retrieval quality.
Findings
11.8% increase in CTR in live search A/B tests
8.2% reduction in bounce rate in online experiments
Slight improvements in NDCG scores for product recommendation
Abstract
Retrieval models such as CLSM is trained on click-through data which treats each clicked query-document pair as equivalent. While training on click-through data is reasonable, this paper argues that it is sub-optimal because of its noisy and long-tail nature (especially for sponsored search). In this paper, we discuss the impact of incorporating or disregarding the long tail pairs in the training set. Also, we propose a weighing based strategy using which we can learn semantic representations for tail pairs without compromising the quality of retrieval. We conducted our experiments on Bing sponsored search and also on Amazon product recommendation to demonstrate that the methodology is domain agnostic. Online A/B testing on live search engine traffic showed improvements in clicks (11.8\% higher CTR) and as well as improvement in quality (8.2\% lower bounce rate) when compared to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Advanced Image and Video Retrieval Techniques · Recommender Systems and Techniques
