Unleashing the Potential of Sparse Attention on Long-term Behaviors for CTR Prediction
Weijiang Lai, Beihong Jin, Di Zhang, Siru Chen, Jiongyan Zhang, Yuhang Gou, Jian Dong, Xingxing Wang

TL;DR
SparseCTR is a novel model that efficiently captures long-term user behaviors using personalized sequence segmentation and a three-branch sparse self-attention mechanism, significantly improving CTR prediction in recommender systems.
Contribution
The paper introduces SparseCTR, a new approach that effectively models long-term user behaviors with personalized segmentation and a three-branch attention mechanism, addressing computational challenges.
Findings
Outperforms state-of-the-art methods in efficiency and accuracy
Maintains performance improvements across three orders of magnitude in FLOPs
Increases CTR by 1.72% and CPM by 1.41% in online A/B testing
Abstract
In recent years, the success of large language models (LLMs) has driven the exploration of scaling laws in recommender systems. However, models that demonstrate scaling laws are actually challenging to deploy in industrial settings for modeling long sequences of user behaviors, due to the high computational complexity of the standard self-attention mechanism. Despite various sparse self-attention mechanisms proposed in other fields, they are not fully suited for recommendation scenarios. This is because user behaviors exhibit personalization and temporal characteristics: different users have distinct behavior patterns, and these patterns change over time, with data from these users differing significantly from data in other fields in terms of distribution. To address these challenges, we propose SparseCTR, an efficient and effective model specifically designed for long-term behaviors of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks
