Efficient Long Sequential User Data Modeling for Click-Through Rate Prediction
Qiwei Chen, Yue Xu, Changhua Pei, Shanshan Lv, Tao Zhuang, Junfeng Ge

TL;DR
This paper introduces ETA-Net, an end-to-end, hashing-based attention model for long user behavior sequences that improves CTR prediction accuracy and efficiency, successfully deployed in Taobao's recommender system.
Contribution
The paper presents ETA-Net, a novel end-to-end attention model with low-cost hashing for efficient long sequence modeling, and demonstrates its industrial deployment and performance gains.
Findings
ETA-Net reduces complexity of standard attention by orders of magnitude.
Deployment on Taobao's system yields 1.8% CTR lift and 3.1% GMV increase.
Model outperforms existing CTR models in accuracy and cost-efficiency.
Abstract
Recent studies on Click-Through Rate (CTR) prediction has reached new levels by modeling longer user behavior sequences. Among others, the two-stage methods stand out as the state-of-the-art (SOTA) solution for industrial applications. The two-stage methods first train a retrieval model to truncate the long behavior sequence beforehand and then use the truncated sequences to train a CTR model. However, the retrieval model and the CTR model are trained separately. So the retrieved subsequences in the CTR model is inaccurate, which degrades the final performance. In this paper, we propose an end-to-end paradigm to model long behavior sequences, which is able to achieve superior performance along with remarkable cost-efficiency compared to existing models. Our contribution is three-fold: First, we propose a hashing-based efficient target attention (TA) network named ETA-Net to enable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Advanced Computing and Algorithms · Image and Video Quality Assessment
