TL;DR
MISSRec introduces a multi-modal pre-training framework for sequential recommendation that leverages diverse user and item information to improve robustness, transferability, and performance in sparse and cold-start scenarios.
Contribution
The paper proposes a novel multi-modal pre-training and transfer learning framework, MISSRec, with a Transformer-based model and interest-aware modules for enhanced sequence representation in recommendation systems.
Findings
Effective in cold-start and sparse ID scenarios
Improves transferability across domains
Achieves superior recommendation accuracy
Abstract
The goal of sequential recommendation (SR) is to predict a user's potential interested items based on her/his historical interaction sequences. Most existing sequential recommenders are developed based on ID features, which, despite their widespread use, often underperform with sparse IDs and struggle with the cold-start problem. Besides, inconsistent ID mappings hinder the model's transferability, isolating similar recommendation domains that could have been co-optimized. This paper aims to address these issues by exploring the potential of multi-modal information in learning robust and generalizable sequence representations. We propose MISSRec, a multi-modal pre-training and transfer learning framework for SR. On the user side, we design a Transformer-based encoder-decoder model, where the contextual encoder learns to capture the sequence-level multi-modal user interests while a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
