AliExpress Learning-To-Rank: Maximizing Online Model Performance without Going Online
Guangda Huzhang, Zhen-Jia Pang, Yongqing Gao, Yawen Liu, Weijie Shen,, Wen-Ji Zhou, Qing Da, An-Xiang Zeng, Han Yu, and Yang Yu, and Zhi-Hua Zhou

TL;DR
This paper introduces an evaluator-generator framework for learning-to-rank in e-commerce that improves online performance by addressing offline-online evaluation inconsistencies, achieving over 2% CR improvement.
Contribution
It proposes a novel evaluator-generator framework incorporating item context and reinforcement learning to enhance online ranking performance without online training.
Findings
Offline metrics can be misleading for online performance.
The proposed evaluator score aligns better with online results.
Achieved over 2% increase in Conversion Rate in online A/B tests.
Abstract
Learning-to-rank (LTR) has become a key technology in E-commerce applications. Most existing LTR approaches follow a supervised learning paradigm from offline labeled data collected from the online system. However, it has been noticed that previous LTR models can have a good validation performance over offline validation data but have a poor online performance, and vice versa, which implies a possible large inconsistency between the offline and online evaluation. We investigate and confirm in this paper that such inconsistency exists and can have a significant impact on AliExpress Search. Reasons for the inconsistency include the ignorance of item context during the learning, and the offline data set is insufficient for learning the context. Therefore, this paper proposes an evaluator-generator framework for LTR with item context. The framework consists of an evaluator that generalizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Recommender Systems and Techniques
