MaTrRec: Uniting Mamba and Transformer for Sequential Recommendation

Shun Zhang; Runsen Zhang; Zhirong Yang

arXiv:2407.19239·cs.IR·July 30, 2024

MaTrRec: Uniting Mamba and Transformer for Sequential Recommendation

Shun Zhang, Runsen Zhang, Zhirong Yang

PDF

Open Access 1 Repo

TL;DR

MaTrRec is a novel sequential recommendation model that combines Mamba and Transformer to effectively handle both long-term and short-term user preferences, improving performance especially in data-sparse scenarios.

Contribution

The paper introduces MaTrRec, a hybrid model that unites Mamba's efficiency with Transformer’s global attention, enhancing recommendation accuracy across various sequence lengths.

Findings

01

Outperforms state-of-the-art models on five public datasets.

02

Significantly improves cold start performance by up to 33%.

03

Effectively captures both long-term and short-term dependencies.

Abstract

Sequential recommendation systems aim to provide personalized recommendations by analyzing dynamic preferences and dependencies within user behavior sequences. Recently, Transformer models can effectively capture user preferences. However, their quadratic computational complexity limits recommendation performance on long interaction sequence data. Inspired by the State Space Model (SSM)representative model, Mamba, which efficiently captures user preferences in long interaction sequences with linear complexity, we find that Mamba's recommendation effectiveness is limited in short interaction sequences, with failing to recall items of actual interest to users and exacerbating the data sparsity cold start problem. To address this issue, we innovatively propose a new model, MaTrRec, which combines the strengths of Mamba and Transformer. This model fully leverages Mamba's advantages in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

unintelligentmumu/matrrec
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Topic Modeling · Sentiment Analysis and Opinion Mining

MethodsAttention Is All You Need · Label Smoothing · Adam · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Position-Wise Feed-Forward Layer · Dense Connections · Multi-Head Attention