MaTrRec: Uniting Mamba and Transformer for Sequential Recommendation
Shun Zhang, Runsen Zhang, Zhirong Yang

TL;DR
MaTrRec is a novel sequential recommendation model that combines Mamba and Transformer to effectively handle both long-term and short-term user preferences, improving performance especially in data-sparse scenarios.
Contribution
The paper introduces MaTrRec, a hybrid model that unites Mamba's efficiency with Transformer’s global attention, enhancing recommendation accuracy across various sequence lengths.
Findings
Outperforms state-of-the-art models on five public datasets.
Significantly improves cold start performance by up to 33%.
Effectively captures both long-term and short-term dependencies.
Abstract
Sequential recommendation systems aim to provide personalized recommendations by analyzing dynamic preferences and dependencies within user behavior sequences. Recently, Transformer models can effectively capture user preferences. However, their quadratic computational complexity limits recommendation performance on long interaction sequence data. Inspired by the State Space Model (SSM)representative model, Mamba, which efficiently captures user preferences in long interaction sequences with linear complexity, we find that Mamba's recommendation effectiveness is limited in short interaction sequences, with failing to recall items of actual interest to users and exacerbating the data sparsity cold start problem. To address this issue, we innovatively propose a new model, MaTrRec, which combines the strengths of Mamba and Transformer. This model fully leverages Mamba's advantages in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Sentiment Analysis and Opinion Mining
MethodsAttention Is All You Need · Label Smoothing · Adam · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Position-Wise Feed-Forward Layer · Dense Connections · Multi-Head Attention
