Unleashing the Potential of Sparse Attention on Long-term Behaviors for CTR Prediction

Weijiang Lai; Beihong Jin; Di Zhang; Siru Chen; Jiongyan Zhang; Yuhang Gou; Jian Dong; Xingxing Wang

arXiv:2601.17836·cs.IR·January 27, 2026

Unleashing the Potential of Sparse Attention on Long-term Behaviors for CTR Prediction

Weijiang Lai, Beihong Jin, Di Zhang, Siru Chen, Jiongyan Zhang, Yuhang Gou, Jian Dong, Xingxing Wang

PDF

Open Access

TL;DR

SparseCTR is a novel model that efficiently captures long-term user behaviors using personalized sequence segmentation and a three-branch sparse self-attention mechanism, significantly improving CTR prediction in recommender systems.

Contribution

The paper introduces SparseCTR, a new approach that effectively models long-term user behaviors with personalized segmentation and a three-branch attention mechanism, addressing computational challenges.

Findings

01

Outperforms state-of-the-art methods in efficiency and accuracy

02

Maintains performance improvements across three orders of magnitude in FLOPs

03

Increases CTR by 1.72% and CPM by 1.41% in online A/B testing

Abstract

In recent years, the success of large language models (LLMs) has driven the exploration of scaling laws in recommender systems. However, models that demonstrate scaling laws are actually challenging to deploy in industrial settings for modeling long sequences of user behaviors, due to the high computational complexity of the standard self-attention mechanism. Despite various sparse self-attention mechanisms proposed in other fields, they are not fully suited for recommendation scenarios. This is because user behaviors exhibit personalization and temporal characteristics: different users have distinct behavior patterns, and these patterns change over time, with data from these users differing significantly from data in other fields in terms of distribution. To address these challenges, we propose SparseCTR, an efficient and effective model specifically designed for long-term behaviors of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks