Self-Attentive Sequential Recommendation
Wang-Cheng Kang, Julian McAuley

TL;DR
This paper introduces SASRec, a self-attention based sequential recommendation model that balances the strengths of Markov Chains and RNNs, capturing long-term user preferences efficiently across different dataset densities.
Contribution
The paper proposes SASRec, a novel self-attention model for sequential recommendation that effectively captures long-term semantics with fewer actions, outperforming existing models in accuracy and efficiency.
Findings
Outperforms state-of-the-art sequential models on various datasets.
More efficient than CNN/RNN-based models by an order of magnitude.
Effectively captures meaningful activity patterns through attention visualization.
Abstract
Sequential dynamics are a key feature of many modern recommender systems, which seek to capture the `context' of users' activities on the basis of actions they have performed recently. To capture such patterns, two approaches have proliferated: Markov Chains (MCs) and Recurrent Neural Networks (RNNs). Markov Chains assume that a user's next action can be predicted on the basis of just their last (or last few) actions, while RNNs in principle allow for longer-term semantics to be uncovered. Generally speaking, MC-based methods perform best in extremely sparse datasets, where model parsimony is critical, while RNNs perform better in denser datasets where higher model complexity is affordable. The goal of our work is to balance these two goals, by proposing a self-attention based sequential model (SASRec) that allows us to capture long-term semantics (like an RNN), but, using an attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Machine Learning in Healthcare · Generative Adversarial Networks and Image Synthesis
