Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction
Pi Qi, Xiaoqiang Zhu, Guorui Zhou, Yujing Zhang, Zhe Wang, Lejian Ren,, Ying Fan, and Kun Gai

TL;DR
This paper introduces the Search-based Interest Model (SIM), a new approach for modeling long sequential user behavior data that improves scalability and accuracy in click-through rate prediction, demonstrated by significant industrial deployment results.
Contribution
We propose the SIM paradigm with cascaded search units to better capture user interests from extremely long behavior sequences, surpassing previous models in scalability and precision.
Findings
SIM achieves up to 54x longer sequence modeling than previous SOTA.
Deployment of SIM in Alibaba's advertising system increased CTR by 7.1%.
SIM effectively models behavior sequences up to 54,000 in length.
Abstract
Rich user behavior data has been proven to be of great value for click-through rate prediction tasks, especially in industrial applications such as recommender systems and online advertising. Both industry and academy have paid much attention to this topic and propose different approaches to modeling with long sequential user behavior data. Among them, memory network based model MIMN proposed by Alibaba, achieves SOTA with the co-design of both learning algorithm and serving system. MIMN is the first industrial solution that can model sequential user behavior data with length scaling up to 1000. However, MIMN fails to precisely capture user interests given a specific candidate item when the length of user behavior sequence increases further, say, by 10 times or more. This challenge exists widely in previously proposed approaches. In this paper, we tackle this problem by designing a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Caching and Content Delivery · Web Data Mining and Analysis
MethodsMemory Network
