Mutual Wasserstein Discrepancy Minimization for Sequential Recommendation
Ziwei Fan, Zhiwei Liu, Hao Peng, Philip S Yu

TL;DR
This paper introduces MStein, a novel self-supervised sequential recommendation framework that uses Wasserstein Discrepancy Measurement to improve mutual information estimation, robustness, and training efficiency over traditional KL-based methods.
Contribution
It proposes a new mutual information measurement based on Wasserstein distance, addressing limitations of KL divergence and enhancing robustness and efficiency in sequential recommendation.
Findings
MStein outperforms baseline methods on benchmark datasets.
Demonstrates robustness against data perturbations.
Shows improved training efficiency with smaller batch sizes.
Abstract
Self-supervised sequential recommendation significantly improves recommendation performance by maximizing mutual information with well-designed data augmentations. However, the mutual information estimation is based on the calculation of Kullback Leibler divergence with several limitations, including asymmetrical estimation, the exponential need of the sample size, and training instability. Also, existing data augmentations are mostly stochastic and can potentially break sequential correlations with random modifications. These two issues motivate us to investigate an alternative robust mutual information measurement capable of modeling uncertainty and alleviating KL divergence limitations. To this end, we propose a novel self-supervised learning framework based on Mutual WasserStein discrepancy minimization MStein for the sequential recommendation. We propose the Wasserstein Discrepancy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques
MethodsContrastive Learning
