TL;DR
This paper introduces a Sparse Attentive Memory network that efficiently models long user behavior sequences for click-through rate prediction, achieving high accuracy and real-time inference on large-scale e-commerce data.
Contribution
It proposes a novel SAM network that models intra-sequence and target dependencies with linear complexity, enabling practical deployment on large-scale systems.
Findings
Supports sequences of length 1000 with under 30ms inference time
Achieves 7.30% CTR improvement in online A/B testing
Effective for both long and short sequence modeling
Abstract
Sequential recommendation predicts users' next behaviors with their historical interactions. Recommending with longer sequences improves recommendation accuracy and increases the degree of personalization. As sequences get longer, existing works have not yet addressed the following two main challenges. Firstly, modeling long-range intra-sequence dependency is difficult with increasing sequence lengths. Secondly, it requires efficient memory and computational speeds. In this paper, we propose a Sparse Attentive Memory (SAM) network for long sequential user behavior modeling. SAM supports efficient training and real-time inference for user behavior sequences with lengths on the scale of thousands. In SAM, we model the target item as the query and the long sequence as the knowledge database, where the former continuously elicits relevant information from the latter. SAM simultaneously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
