# On Application of Learning to Rank for E-Commerce Search

**Authors:** Shubhra Kanti Karmaker Santu, Parikshit Sondhi, ChengXiang Zhai

arXiv: 1903.04263 · 2019-03-12

## TL;DR

This paper explores the application of Learning to Rank techniques to E-Commerce search, addressing unique challenges and providing practical insights for optimizing ranking models using industry data.

## Contribution

It identifies key challenges in applying LETOR to E-Commerce search and offers guidance on feature selection, feedback signal utilization, and relevance judgment collection.

## Key findings

- Popularity-based features significantly improve ranking performance.
- Query attribute sparsity impacts LETOR effectiveness and can be mitigated.
- Order rate is the most robust feedback signal for training.

## Abstract

E-Commerce (E-Com) search is an emerging important new application of information retrieval. Learning to Rank (LETOR) is a general effective strategy for optimizing search engines, and is thus also a key technology for E-Com search. While the use of LETOR for web search has been well studied, its use for E-Com search has not yet been well explored. In this paper, we discuss the practical challenges in applying learning to rank methods to E-Com search, including the challenges in feature representation, obtaining reliable relevance judgments, and optimally exploiting multiple user feedback signals such as click rates, add-to-cart ratios, order rates, and revenue. We study these new challenges using experiments on industry data sets and report several interesting findings that can provide guidance on how to optimally apply LETOR to E-Com search: First, popularity-based features defined solely on product items are very useful and LETOR methods were able to effectively optimize their combination with relevance-based features. Second, query attribute sparsity raises challenges for LETOR, and selecting features to reduce/avoid sparsity is beneficial. Third, while crowdsourcing is often useful for obtaining relevance judgments for Web search, it does not work as well for E-Com search due to difficulty in eliciting sufficiently fine grained relevance judgments. Finally, among the multiple feedback signals, the order rate is found to be the most robust training objective, followed by click rate, while add-to-cart ratio seems least robust, suggesting that an effective practical strategy may be to initially use click rates for training and gradually shift to using order rates as they become available.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.04263/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1903.04263/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/1903.04263/full.md

---
Source: https://tomesphere.com/paper/1903.04263