Sequential Recommendation via Stochastic Self-Attention
Ziwei Fan, Zhiwei Liu, Alice Wang, Zahra Nazari, Lei Zheng, Hao Peng,, Philip S. Yu

TL;DR
This paper introduces STOSA, a stochastic self-attention model for sequential recommendation that captures uncertainty in user behaviors and improves cold start item recommendations by embedding items as Gaussian distributions and using Wasserstein self-attention.
Contribution
The paper proposes a novel stochastic self-attention mechanism with Wasserstein distance to better model uncertainty and collaborative transitivity in sequential recommendation.
Findings
Outperforms state-of-the-art methods on five benchmark datasets.
Shows significant improvements on cold start items.
Effectively models uncertainty with Gaussian embeddings.
Abstract
Sequential recommendation models the dynamics of a user's previous behaviors in order to forecast the next item, and has drawn a lot of attention. Transformer-based approaches, which embed items as vectors and use dot-product self-attention to measure the relationship between items, demonstrate superior capabilities among existing sequential methods. However, users' real-world sequential behaviors are \textit{\textbf{uncertain}} rather than deterministic, posing a significant challenge to present techniques. We further suggest that dot-product-based approaches cannot fully capture \textit{\textbf{collaborative transitivity}}, which can be derived in item-item transitions inside sequences and is beneficial for cold start items. We further argue that BPR loss has no constraint on positive and sampled negative items, which misleads the optimization. We propose a novel \textbf{STO}chastic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Machine Learning in Healthcare · Mental Health via Writing
