Modeling Sequences as Distributions with Uncertainty for Sequential Recommendation
Ziwei Fan, Zhiwei Liu, Lei Zheng, Shen Wang, Philip S. Yu

TL;DR
This paper introduces DT4SR, a distribution-based Transformer model that incorporates uncertainty into sequential recommendation, effectively capturing stochastic user behaviors and alleviating cold-start problems.
Contribution
It proposes a novel distribution-based Transformer using Elliptical Gaussian distributions and Wasserstein distance for uncertainty modeling in sequential recommendation.
Findings
Outperforms state-of-the-art methods on benchmark datasets
Effectively alleviates cold-start issues
Demonstrates the importance of modeling uncertainty in sequences
Abstract
The sequential patterns within the user interactions are pivotal for representing the user's preference and capturing latent relationships among items. The recent advancements of sequence modeling by Transformers advocate the community to devise more effective encoders for the sequential recommendation. Most existing sequential methods assume users are deterministic. However, item-item transitions might fluctuate significantly in several item aspects and exhibit randomness of user interests. This \textit{stochastic characteristics} brings up a solid demand to include uncertainties in representing sequences and items. Additionally, modeling sequences and items with uncertainties expands users' and items' interaction spaces, thus further alleviating cold-start problems. In this work, we propose a Distribution-based Transformer for Sequential Recommendation (DT4SR), which injects…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Multi-Head Attention · Adam · Label Smoothing · Residual Connection · Dense Connections
